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Preface 


The world is continuous, but the mind is discrete. 


David Mumford 


We seek to bridge some critical gaps between various fields of mathematics by 
studying the interplay between the continuous volume and the discrete vol- 
ume of polytopes. Examples of polytopes in three dimensions include crystals, 
boxes, tetrahedra, and any convex object whose faces are all flat. It is amusing 
to see how many problems in combinatorics, number theory, and many other 
mathematical areas can be recast in the language of polytopes that exist in 
some Euclidean space. Conversely, the versatile structure of polytopes gives 
us number-theoretic and combinatorial information that flows naturally from 
their geometry. 





Fig. 0.1. Continuous and discrete volume. 


The discrete volume of a body P can be described intuitively as the number 
of grid points that lie inside P, given a fixed grid in Euclidean space. The 
continuous volume of P has the usual intuitive meaning of volume that we 
attach to everyday objects we see in the real world. 
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Indeed, the difference between the two realizations of volume can be 
thought of in physical terms as follows. On the one hand, the quantum- 
level grid imposed by the molecular structure of reality gives us a discrete 
notion of space and hence discrete volume. On the other hand, the New- 
tonian notion of continuous space gives us the continuous volume. We see 
things continuously at the Newtonian level, but in practice we often compute 
things discretely at the quantum level. Mathematically, the grid we impose 
in space—corresponding to the grid formed by the atoms that make up an 
object—helps us compute the usual continuous volume in very surprising and 
charming ways, as we shall discover. 

In order to see the continuous/discrete interplay come to life among the 
three fields of combinatorics, number theory, and geometry, we begin our fo- 
cus with the simple-to-state coin-exchange problem of Frobenius. The beauty 
of this concrete problem is that it is easy to grasp, it provides a useful com- 
putational tool, and yet it has most of the ingredients of the deeper theories 
that are developed here. 

In the first chapter, we give detailed formulas that arise naturally from 
the Frobenius coin-exchange problem in order to demonstrate the intercon- 
nections between the three fields mentioned above. The coin-exchange problem 
provides a scaffold for identifying the connections between these fields. In the 
ensuing chapters we shed this scaffolding and focus on the interconnections 
themselves: 


(1) Enumeration of integer points in polyhedra—combinatorics, 
(2) Dedekind sums and finite Fourier series—number theory, 
(3) Polygons and polytopes—geometry. 


We place a strong emphasis on computational techniques, and on com- 
puting volumes by counting integer points using various old and new ideas. 
Thus, the formulas we get should not only be pretty (which they are!) but 
should also allow us to efficiently compute volumes by using some nice func- 
tions. In the very rare instances of mathematical exposition when we have a 
formulation that is both “easy to write” and “quickly computable,” we have 
found a mathematical nugget. We have endeavored to fill this book with such 
mathematical nuggets. 

Much of the material in this book is developed by the reader in the more 
than 200 exercises. Most chapters contain warm-up exercises that do not de- 
pend on the material in the chapter and can be assigned before the chapter 
is read. Some exercises are central, in the sense that current or later material 
depends on them. Those exercises are marked with &, and we give detailed 
hints for them at the end of the book. Most chapters also contain lists of open 
research problems. 

It turns out that even a fifth grader can write an interesting paper on 
integer-point enumeration [145], while the subject lends itself to deep inves- 
tigations that attract the current efforts of leading researchers. Thus, it is an 
area of mathematics that attracts our innocent childhood questions as well 
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as our refined insight and deeper curiosity. The level of study is highly ap- 
propriate for a junior/senior undergraduate course in mathematics. In fact, 
this book is ideally suited to be used for a capstone course. Because the three 
topics outlined above lend themselves to more sophisticated exploration, our 
book has also been used effectively for an introductory graduate course. 

To help the reader fully appreciate the scope of the connections between 
the continuous volume and the discrete volume, we begin the discourse in two 
dimensions, where we can easily draw pictures and quickly experiment. We 
gently introduce the functions we need in higher dimensions (Dedekind sums) 
by looking at the coin-exchange problem geometrically as the discrete volume 
of a generalized triangle, called a simplex. 

The initial techniques are quite simple, essentially nothing more than ex- 
panding rational functions into partial fractions. Thus, the book is easily ac- 
cessible to a student who has completed a standard college calculus and linear 
algebra curriculum. It would be useful to have a basic understanding of par- 
tial fraction expansions, infinite series, open and closed sets in Rf, complex 
numbers (in particular, roots of unity), and modular arithmetic. 

An important computational tool that is harnessed throughout the text is 
the generating function f(x) = )>*°_,a(m)x™, where the a(m)’s form any 
sequence of numbers that we are interested in analyzing. When the infinite 
sequence of numbers a(m),m = 0,1,2,..., is embedded into a single generat- 
ing function f(x), it is often true that for hitherto unforeseen reasons, we can 
rewrite the whole sum f(x) in a surprisingly compact form. It is the rewriting 
of these generating functions that allows us to understand the combinatorics 
of the relevant sequence a(m). For us, the sequence of numbers might be the 
number of ways to partition an integer into given coin denominations, or the 
number of points in an increasingly large body, and so on. Here we find yet 
another example of the interplay between the discrete and the continuous: we 
are given a discrete set of numbers a(m), and we then carry out analysis on 
the generating function f(a) in the continuous variable x. 


What Is the Discrete Volume? 


The physically intuitive description of the discrete volume given above rests 
on a sound mathematical footing as soon as we introduce the notion of a 
lattice. The grid is captured mathematically as the collection of all integer 
points in Euclidean space, namely Z? = {(21,...,@a) : all a, € Z}. This 
discrete collection of equally spaced points is called a lattice. If we are given 
a geometric body P, its discrete volume is simply defined as the number of 
lattice points inside P, that is, the number of elements in the set Z?N P. 
Intuitively, if we shrink the lattice by a factor k and count the number 
of newly shrunken lattice points inside P, we obtain a better approximation 
for the volume of P, relative to the volume of a single cell of the shrunken 
lattice. It turns out that after the lattice is shrunk by an integer factor k, the 
number # (P N izt) of shrunken lattice points inside an integral polytope P 
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is magically a polynomial in k. This counting function # (P fÀ izt) is known 
as the Ehrhart polynomial of P. If we kept shrinking the lattice by taking a 
limit, we would of course end up with the continuous volume that is given by 
the usual Riemannian integral definition of calculus: 


; l a\ 1 

vol P = lim # (Po Ze ) Ea: 

However, pausing at fixed dilations of the lattice gives surprising flexibility 

for the computation of the volume of P and for the number of lattice points 
that are contained in P. 

Thus, when the body P is an integral polytope, the error terms that mea- 
sure the discrepancy between the discrete volume and the usual continuous 
volume are quite nice; they are given by Ehrhart polynomials, and these enu- 
meration polynomials are the content of Chapter 3. 


The Fourier—Dedekind Sums Are the Building Blocks: Number 
Theory 


Every polytope has a discrete volume that is expressible in terms of certain 
finite sums that are known as Dedekind sums. Before giving their definition, we 
first motivate these sums with some examples that illustrate their building- 
block behavior for lattice-point enumeration. To be concrete, consider for 
example a 1-dimensional polytope given by an interval P = [0,a], where a is 
any positive real number. It is clear that we need the greatest integer function 
|x| to help us enumerate the lattice points in P, and indeed the answer is 
la] +1. 

Next, consider a 1-dimensional line segment that is sitting in the 2- 
dimensional plane. Let’s pick our segment P so that it begins at the origin 
and ends at the lattice point (c,d). As becomes apparent after a moment’s 
thought, the number of lattice points on this finite line segment involves an 
old friend, namely the greatest common divisor of c and d. The exact number 
of lattice points on the line segment is ged(c, d) + 1. 

To unify both of these examples, consider a triangle P in the plane whose 
vertices have rational coordinates. It turns out that a certain finite sum is 
completely natural because it simultaneously extends both the greatest integer 
function and the greatest common divisor, although the latter is less obvious. 
An example of a Dedekind sum in two dimensions that arises naturally in the 
formula for the discrete volume of the rational triangle P is the following: 


s,s) E (P-a (ME [28 -2). 


m=1 





The definition makes use of the greatest integer function. Why do these sums 
also resemble the greatest common divisor? Luckily, the Dedekind sums sat- 
isfy a remarkable reciprocity law, quite similar to the Euclidean algorithm 
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that computes the gcd. This reciprocity law allows the Dedekind sums to be 
computed in roughly log(b) steps rather than the b steps that are implied by 
the definition above. The reciprocity law for s(a,b) lies at the heart of some 
amazing number theory that we treat in an elementary fashion, but that also 
comes from the deeper subject of modular forms and other modern tools. 

We find ourselves in the fortunate position of viewing an important tip of 
an enormous mountain of ideas, submerged by the waters of geometry. As we 
delve more deeply into these waters, more and more hidden beauty unfolds 
for us, and the Dedekind sums are an indispensable tool that allow us to see 
further as the waters get deeper. 


The Relevant Solids Are Polytopes: Geometry 


The examples we have used, namely line segments and polygons in the plane, 
are special cases of polytopes in all dimensions. One way to define a polytope 
is to consider the convex hull of a finite collection of points in Euclidean 
space R?. That is, suppose someone gives us a set of points V1,...,Vn in 
R?. The polytope determined by the given points vj is defined by all linear 
combinations c1 V1 +C2V2 +: :+CnVn, where the coefficients cj are nonnegative 
real numbers that satisfy the relation cı +c2+-:-:+¢, = 1. This construction 
is called the vertex description of the polytope. 

There is another equivalent definition, called the hyperplane description 
of the polytope. Namely, if someone hands us the linear inequalities that 
define a finite collection of half-spaces in R, we can define the associated 
polytope as the simultaneous intersection of the half-spaces defined by the 
given inequalities. 

There are some “obvious” facts about polytopes that are intuitively clear 
to most students but are, in fact, subtle and often nontrivial to prove from first 
principles. Two of these facts, namely that every polytope has both a vertex 
and a hyperplane description, and that every polytope can be triangulated, 
form a crucial basis to the material we will develop in this book. We carefully 
prove both facts in the appendices. The two main statements in the appen- 
dices are intuitively clear, so that novices can skip over their proofs without 
any detriment to their ability to compute continuous and discrete volumes of 
polytopes. All theorems in the text (including those in the appendices) are 
proved from first principles, with the exception of the last chapter, where we 
assume basic notions from complex analysis. 

The text naturally flows into two parts, which we now explicate. 


Part I 


We have taken great care in making the content of the chapters flow seamlessly 
from one to the next, over the span of the first six chapters. 


e Chapters 1 and 2 introduce some basic notions of generating functions, in 
the visually compelling context of discrete geometry, with an abundance 
of detailed motivating examples. 
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Fig. 0.2. The partially ordered set of chapter dependencies. 


e Chapters 3, 4, and 5 develop the full Ehrhart theory of discrete volumes 
of rational polytopes. 

e Chapter 6 is a “dessert” chapter, in that it enables us to use the theory 
developed to treat the enumeration of magic squares, an ancient topic that 
enjoys active current research. 


Part II 


We now begin anew. 


e Having attained experience with numerous examples and results about in- 
teger polytopes, we are ready to learn about the Dedekind sums of Chap- 
ter 8, which form the atomic pieces of the discrete volume polynomials. On 
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the other hand, to fully understand Dedekind sums, we need to understand 
finite Fourier analysis, which we therefore develop from first principles in 
Chapter 7, using only partial fractions. 

e Chapter 9 answers a simple yet tricky question: how does the finite geo- 
metric series in one dimension extend to higher-dimensional polytopes? 
Brion’s theorem give the elegant and decisive answer to this question. 

e Chapter 10 extends the interplay between the continuous volume and the 
discrete volume of a polytope (already studied in detail in Part I) by 
introducing Euler-Maclaurin summation formulas in all dimensions. These 
formulas compare the continuous Fourier transform of a polytope to its 
discrete Fourier transform, yet the material is completely self-contained. 

e Chapter 11 develops an exciting extension of Ehrhart theory that defines 
and studies the solid angles of a polytope; these are the natural extensions 
of 2-dimensional angles to higher dimensions. 

e Finally, we end with another “dessert” chapter that uses complex ana- 
lytic methods to find an integral formula for the discrepancy between the 
discrete and continuous areas enclosed by a closed curve in the plane. 


Because polytopes are both theoretically useful (in triangulated manifolds, 
for example) and practically essential (in computer graphics, for example) we 
use them to link results in number theory and combinatorics. There are many 
research papers being written on these interconnections, even as we speak, 
and it is impossible to capture them all here; however, we hope that these 
modest beginnings will give the reader who is unfamiliar with these fields a 
good sense of their beauty, inexorable connectedness, and utility. We have 
written a gentle invitation to what we consider a gorgeous world of counting 
and of links between the fields of combinatorics, number theory, and geometry 
for the general mathematical reader. 

There are a number of excellent books that have a nontrivial intersec- 
tion with ours and contain material that complements the topics discussed 
here. We heartily recommend the monographs of Barvinok [12] (on general 
convexity topics), Ehrhart [81] (the historic introduction to Ehrhart theory), 
Ewald [82] (on connections to algebraic geometry), Hibi [96] (on the interplay 
of algebraic combinatorics with polytopes), Miller-Sturmfels [132] (on com- 
putational commutative algebra), and Stanley [172] (on general enumerative 
problems in combinatorics). 
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The Coin-Exchange Problem of Frobenius 


The full beauty of the subject of generating functions emerges only from tuning in 
on both channels: the discrete and the continuous. 


Herbert Wilf [187] 


Suppose we’re interested in an infinite sequence of numbers (ax); that arises 
geometrically or recursively. Is there a “good formula” for a, as a function of 
k? Are there identities involving various a;,’s? Embedding this sequence into 
the generating function 


F(z) = S- ax 2* 


k>0 


allows us to retrieve answers to the questions above in a surprisingly quick 
and elegant way. We can think of F(z) as lifting our sequence a, from its 
discrete setting into the continuous world of functions. 


1.1 Why Use Generating Functions? 


To illustrate these concepts, we warm up with the classic example of the 
Fibonacci sequence fk, named after Leonardo Pisano Fibonacci (1170- 
12507)! and defined by the recursion 


fo = 9, fi =1, and fk+2 = fk+1 + fk for k>0. 


This gives the sequence (fk) _o = (0,1,1, 2,3, 5,8, 13, 21,34,...) (see also 
[165, Sequence A000045]). Now let’s see what generating functions can do for 
us. Let 


1 For more information about Fibonacci, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Fibonacci.html. 


4 1 The Coin-Exchange Problem of Frobenius 
F(z) = 5 fez". 
k>0 
We embed both sides of the recursion identity into their generating functions: 
ieee” = ath) = a A fe AD 
k>0 k>0 k>0 k>0 
The left-hand side of (1.1) is 
1 1 1 
ko k4+2 _ k 
X fe" = olaa = z% h? = -z (F) - 2), 
k>0 k>0 k>2 
while the right-hand side of (1.1) is 
1 
So fent +Y fez" = —F(z)+ F(z). 
z 
k>0 k>0 


So (1.1) can be restated as 





= (Fe) -2) = SF) + FO), 
or y 
a ae er 


It’s fun to check (e.g., with a computer) that when we expand the function F' 
into a power series, we indeed obtain the Fibonacci numbers as coefficients: 


Ft 22 4978 432445254 87941327 4-21 28 4.34294... 








1—2z— 2? 


Now we use our favorite method of handling rational functions: the par- 
tial fraction expansion. In our case, the denominator factors as 1 — z — z? = 
(1 14/5 z) (1 1-5 a) and the partial fraction expansion is (see Exer- 


cise 1.1) 





a E E E UNS (1.2) 
lezz % 1+5 z 1 AAS z l 








The two terms suggest the use of the geometric series 


1 
oe E (1.3) 


k>0 





(see Exercise 1.2) with x = 14-6 z and z = 1-75 z, respectively: 


z 1 1+v5 i 1 1- v5 i 
mmia) aE (5) 


k>0 
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Comparing the coefficients of z* in the definition of F(z) = ops fk z" and 
the new expression above for F(z), we discover the closed form expression for 
the Fibonacci sequence 


_\ k k 
Paca v5 1 f1- v5 
HJN 2 wah 2 
This method of decomposing a rational generating function into partial 
fractions is one of our key tools. Because we will use partial fractions time 


and again throughout this book, we record the result on which this method 
is based. 





Theorem 1.1 (Partial fraction expansion). Given any rational function 


n p(z) 
F(z) := lees (z = Gx) ’ 


where p is a polynomial of degree less than e1 +€2+---+€m and the ax’s are 
distinct, there exists a decomposition 


m 
Ck,1 Ck,2 Ck, 
ro-5 (2 pms a] 


2 
k=1 (z — ax) 





where cz; E C are unique. 


One possible proof of this theorem is based on the fact that the polynomials 
form a Euclidean domain. For readers who are acquainted with this notion, 
we outline this proof in Exercise 1.35. 


1.2 Two Coins 


Let’s imagine that we introduce a new coin system. Instead of using pennies, 
nickels, dimes, and quarters, let’s say we agree on using 4-cent, 7-cent, 9-cent, 
and 34-cent coins. The reader might point out the following flaw of this new 
system: certain amounts cannot be changed (that is, created with the available 
coins), for example, 2 or 5 cents. On the other hand, this deficiency makes our 
new coin system more interesting than the old one, because we can ask the 
question, “which amounts can be changed?” In fact, we will prove in Exercise 
1.20 that there are only finitely many integer amounts that cannot be changed 
using our new coin system. A natural question, first tackled by Ferdinand 
Georg Frobenius (1849-1917),? and James Joseph Sylvester (1814-1897)? is, 


2 For more information about Frobenius, see 
http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Frobenius.html. 

3 For more information about Sylvester, see 
http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Sylvester.html. 
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“what is the largest amount that cannot be changed?” As mathematicians, 
we like to keep questions as general as possible, and so we ask, given coins of 


denominations a1, a2,...,@qa, which are positive integers without any common 
factor, can you give a formula for the largest amount that cannot be changed 
using the coins a1, d@2,...,aq? This problem is known as the Frobenius coin- 


exchange problem. 
To be precise, suppose we’re given a set of positive integers 


A= {a1,a2,...,@a} 


with gcd (a1, a2,...,@a) = 1 and we call an integer n representable if there 
exist nonnegative integers m1,mg,...,7™mq such that 


n = M101 +++: + Madad. 


In the language of coins, this means that we can change the amount n us- 
ing the coins a1,a2,...,@a. The Frobenius problem (often called the linear 
Diophantine problem of Frobenius) asks us to find the largest integer that is 
not representable. We call this largest integer the Frobenius number and 
denote it by g(a1,...,@a). The following theorem gives us a pretty formula 
for d = 2. 


Theorem 1.2. [f a, and az are relatively prime positive integers, then 
g (a1, 42) = a102 — ay — a2. 


This simple-looking formula for g inspired a great deal of research into 
formulas for g (a1, @2,...,@q) with only limited success; see the notes at the 
end of this chapter. For d = 2, Sylvester gave the following result. 


Theorem 1.3 (Sylvester’s theorem). Let a; and ag be relatively prime 
positive integers. Exactly half of the integers between 1 and (a, — 1) (a2 — 1) 
are representable. 


Our goal in this chapter is to prove these two theorems (and a little more) 
using the machinery of partial fractions. We approach the Frobenius problem 
through the study of the restricted partition function 


pa(n) = # { (m... Mma) E€ Z° : all m; > 0, miar +++++maaa =n}, 


the number of partitions of n using only the elements of A as parts. In 
view of this partition function, g(a1,...,@q) is the largest integer n for which 
pa(n) = 0. 

There is a beautiful geometric interpretation of the restricted partition 
function. The geometric description begins with the set 


4 A partition of a positive integer n is a multiset (i.e., a set in which we allow 
repetition) {n1, n2, ..., Nk} of positive integers such that n = nı + n2 +- +nk. 
The numbers n1, n2,..., Npk are called the parts of the partition. 


7 


1.3 Partial Fractions and a Surprising Formula 
,tq) ERÊ: all x; = 0, £10, +: + raaa = 1}. (1.4) 


P = { (x1, ae 
The n* dilate of any set S C Rê is 


.;m&q): (£1,..., £a) E S}. 


{(nz1, nz2,.. 

The function p(n) counts precisely those integer points that lie in the nt? 
integer dilate of the body P. The dilation process in this context is tantamount 
to replacing £1041 +- -*+£q4aa = 1 in the definition of P by x1a,+---+xqaq =N. 
The set P turns out to be a polytope. We can easily picture P and its dilates 


for dimension d < 3; Figure 1.1 shows the three-dimensional case. 


-7o 


oR 








ela 


ale 


Fig. 1.1. d=3. 


1.3 Partial Fractions and a Surprising Formula 


We first concentrate on the case d = 2 and study 
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Pta b} (n) = # {(k, 1) €Z?:k,l>0, ak+bl= n} ; 


Recall that we require a and b to be relatively prime. To begin our discus- 
sion, we start playing with generating functions. Consider the product of the 
following two geometric series: 


1 1 


(see Exercise 1.2). If we multiply out all the terms we’ll get a power series all 
of whose exponents are linear combinations of a and b. In fact, the coefficient 
of z” in this power series counts the number of ways that n can be written as 
a nonnegative linear combination of a and b. In other words, these coefficients 
are precisely evaluations of our counting function Pa, b}: 


(5) (a) < EV = puno) 


k>0 1>0 n>0 








So this function is the generating function for the sequence of integers 
(Pta b(n n)) . The idea is now to study the compact function on the left. 

We would ‘like to uncover an interesting formula for prab} (n) by looking at 
the generating function on the left more closely. To make our computational 
life easier, we study the constant term of a related series; namely, P{a,b}(n) is 
the constant term of 


f(z) = EFONTS =X piao} (k) soe 


k>0 





The latter series is not quite a power series, since it includes terms with neg- 
ative exponents. These series are called Laurent series, after Pierre Alphonse 
Laurent (1813-1854). For a power series (centered at 0), we could simply eval- 
uate the corresponding function at z = 0 to obtain the constant term; once 
we have negative exponents, such an evaluation is not possible. However, if 
we first subtract all terms with negative exponents, we’ll get a power series 
whose constant term (which remains unchanged) can now be computed by 
evaluating this remaining function at z = 0. 

To be able to compute this constant term, we will expand f into partial 
fractions. As a warm-up to partial fraction decompositions, we first work out 
a one-dimensional example. Let’s denote the first atè root of unity by 


2 2 
bq = TA = cos £ + isin = ; 
a a 
then all the at? roots of unity are 1, £a, €2,€8,...,€271. 
Example 1.4. Let’s find the pee fraction expansion of jz. The poles of 
this function are located at all at? roots of unity €* for k = 0, 1 ...,@a — 1. So 


we expand 
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1 —~ Ch 
1-22 = z— Ek 
k=0 
How do we find the coefficients C? Well 
1 1 ES 
= (2 — €a) (=) 2 -—aze! a’ 


where we have used DL’Hôpital’s rule in the penultimate equality. Therefore, 
we arrive at the expansion 


a—1 
1 1 eh 
1-24 pore 


k=0 

















Returning to restricted partitions, the poles of f are located at z = 0 with 
multiplicity n, at z = 1 with multiplicity 2, and at all the other a"? and bt? 
roots of unity with multiplicity 1 because a and b are relatively prime. Hence 
our partial fraction expansion looks like 


A AS A, B = 


as eee ee ee F T EE +> n (1.5) 
a j=l i b 





We invite the reader to compute the coefficients (Exercise 1.21) 








1 
Ch = (1.6) 
G(r Een 
D; = : 


b(1 = F set) ` 


To compute Bz, we multiply both sides of (1.5) by (z — 1)? and take the limit 
as z — 1 to obtain 
(z- 1) 1 


B = li = 
a eae (1—2%)(1—24)z" ab’ 





by applying L’Hopital’s rule twice, for example. For the more interesting con- 
stant B1, we compute 


B, = lim (z-1) : ab ee ee 
= vA = 
bayaa (1—29) (1 — 2) 2 (z— 1)? ab 2a 2b ab’ 


again by applying L’Hopital’s rule. 

We don’t need to compute the coefficients A1,..., An, since they con- 
tribute only to the terms with negative exponents, which we can safely ne- 
glect; these terms do not contribute to the constant term of f. Once we have 





10 1 The Coin-Exchange Problem of Frobenius 


the other coefficients, the constant term of the Laurent series of f is—as we 
said above—the following function evaluated at 0: 


By Bo, Ce Dj 
Pta b} (n) = beh Gate pier: oe j 











K z=0 
-1 b-1 
C D 
=-Bı+B:-% -5 L 
k=1 Sa j=1 & 
With (1.6) in hand, this simplifies to 
a-1 b-1 
1 1 no 1l 1 1 1 
= | nay ce 
Paw (n) = 55 + 35 abt a em t LY GaP ae) 


Encouraged by this initial success, we now proceed to analyze each sum in 
(1.7) with the hope of recognizing them as more familiar objects. 

For the next step we need to define the greatest-integer function |z], 
which denotes the greatest integer less than or equal to x. A close sibling 
to this function is the fractional-part function {x} = x — |x]. To readers 
not familiar with the functions |x| and {x} we recommend working through 
Exercises 1.3-1.5. 

What we’ll do next is study a special case, namely b = 1. This is appealing 
because P{a,1} (7) simply counts integer points in an interval: 


Pray (0) = # {(k, 1) EZ: k,l > 0, ak+l=n} 
=#{kEZ: k20, ak<n} 
n 
= = < Án 
#{kez o<k<?} 


-jan 


(See Exercise 1.3.) On the other hand, in (1.7) we just computed a different 
expression for this function, so that 


a-—1 

1 1 n 1 1 n 

2a | 2 ! a ! a E = Ptan(n) = lee 
k=1 cies 





With the help of the fractional-part function {z} = x — |x], we have derived 
a formula for the following sum over a‘ roots of unity: 





Ie 1 : i : 
a eR = Vee oon (1.8) 


We’re almost there: we invite the reader (Exercise 1.22) to show that 
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1 1 OIS 1 M 
a Ue ae (1.9) 


where b~! is an integer such that b~-'b = 1 mod a, and to conclude that 


= 1 bin i a i 
a2 eee T { a ae 2a (110) 


k= 








Now all that’s left to do is to substitute this expression back into (1.7), which 
yields the following beautiful formula due to Tiberiu Popoviciu (1906-1975). 


Theorem 1.5 (Popoviciu’s theorem). Ifa and b are relatively prime, then 


n bin atn 
— 1 
root) = {Afb an, 


where b™1b = 1 mod a and a™ta = 1 mod b. 

















1.4 Sylvester’s Result 


Before we apply Theorem 1.5 to obtain the classical Theorems 1.2 and 1.3, we 
return for a moment to the geometry behind the restricted partition function 
P{a,b} (n). In the two-dimensional case (which is the setting of Theorem 1.5), 
we are counting integer points (x,y) € Z? on the line segments defined by the 
constraints 

ax + by =n, z,y> 0. 


As n increases, the line segment gets dilated. It is not too far-fetched (al- 
though Exercise 1.13 teaches us to be careful with such statements) to expect 
that the likelihood for an integer point to lie on the line segment increases 
with n. In fact, one might even guess that the number of points on the line 
segment increases linearly with n, since the line segment is a one-dimensional 
object. Theorem 1.5 quantifies the previous statement in a very precise form: 
P{a b} (n) has the “leading term” n/ab, and the remaining terms are bounded 
as functions in n. Figure 1.2 shows the geometry behind the counting func- 
tion pr4,7}(n) for the first few values of n. Note that the thick line segment 
for n = 17 = 4-7—4-—7 is the last one that does not contain any integer 
point. 


Lemma 1.6. Ifa and b are relatively prime positive integers andn € [1, ab—1] 
is not a multiple of a or b, then 


P{a,b} (n) + PLa,b} (ab = n) =1. 


In other words, for n between 1 and ab— 1 and not divisible by a or b, exactly 
one of the two integers n and ab — n is representable in terms of a and b. 
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°c 
> 


ot 








p 


oe 
pi 
N 


3 4 5 6 7 8 9 x 


Fig. 1.2. 4z + Ty =n, n=1,2,... 


Proof. This identity follows directly from Theorem 1.5: 


mi ps pa) poih 


a 
n —b in —a7'n 
ace ae 
(+) -z a {=}, fo} 
ab a b 


=1- P{a,b} (n) : 








II 
© 








Here, (x) follows from the fact that {—x} = 1— {x} if x ¢ Z (see Exercise 1.5). 














Proof of Theorem 1.2. We have to show that pta b} (ab — a — b) = 0 and that 
Pta b} (n) > 0 for every n > ab — a — b. The first assertion follows with Exer- 
cise 1.24, which states that P{a p} (a +b) = 1, and Lemma 1.6. To prove the 
second assertion, we note that for any integer m, {m=} <1- 4. Hence for any 
positive integer n, 


ab—a—b+n 1 1 n 
i > = — P 
P{a,p}(ab—a—b+n) > a (1 =) (1 ;) + po 

















Proof of Theorem 1.8. Recall that Lemma 1.6 states that for n between 1 and 
ab—1 and not divisible by a or b, exactly one of n and ab—n is representable. 
There are 

ab—a—b+1=(a—1)(b-1) 
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integers between 1 and ab — 1 that are not divisible by a or b. Finally, we 
note that pra4}(n) > 0 if n is a multiple of a or b, by the very definition of 
P{a,b} (n). Hence the number of nonrepresentable integers is $(a—1)(b—1). 














Note that we have proved even more. Essentially by Lemma 1.6, every 
positive integer less than ab has at most one representation. Hence, the rep- 
resentable integers less than ab are uniquely representable (see also Exer- 
cise 1.25). 


1.5 Three and More Coins 


What happens to the complexity of the Frobenius problem if we have more 
than two coins? Let’s go back to our restricted partition function 


n) = #{(m1,...,ma) € Zf: all m; > 0, mya, + + maaa =n}, 


where A = {a1,..., aa}. By the very same reasoning as in Section 1.3, we can 
easily write down the generating function for p4 (n): 


Smo = e (be): 


We use the same methods that were exploited in Section 1.3 to recover our 
function p4 (n) as the constant term of a useful generating function. Namely, 








1 
pa(n) = const (z = za) (1 _ z2)... (1 = =F) . 


We now expand the function on the right into partial fractions. For reasons 











of simplicity we assume in the following that a1,...,aq are pairwise relatively 
prime; that is, no two of the integers a,,a2,...,a@q have a common factor. 
Then our partial fraction expansion looks like 
1 
i= (1 —21)---(1— gaa) gn 
SPE Nae Ok SE a ica 
z z2 z” z—1 (z-1)? (z— 1) 


-1 


ai—1 a2 ag—1 
P D E ES i a > n 
k=1 











ae a 


By now we’re experienced in computing partial fraction coefficients, so that 
the reader will easily verify that (Exercise 1.29) 


1 


a (1 = i) (1 E g) > (1- a gh(n- 1)° 





Cik = (1.12) 
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As before, we don’t have to compute the coefficients A,,...,A,, because they 
don’t contribute to the constant term of f. For the computation of B,,..., Ba, 
we may use a symbolic manipulation program such as Maple or Mathematica. 
Again, once we have calculated these coefficients, we can compute the constant 
term of f by dropping all negative exponents and evaluating the remaining 
function at 0: 




















ai—1 aq—1 
Bi. _ Ba œ Cir Cak 
pate) = (terga D t 5 ae 
k=1 ar k=1 a) |z=0 
aı—1 C a2—1 C ag—1 C 
= —B, + By—---+(-1)*Ba 5 a 5 a vee 5 a 
k=1 °@1 k=1 >2 k=1 Saad 


Substituting the expression we found for Cy, into the latter sum over the 
nontrivial at? roots of unity, for example, gives rise to 


1 D 1 
M et (1- w (1- ie) (1- i pn 


This motivates the definition of the Fourier-Dedekind sum 





kn 


b—1 
Sn (Q1,G2,.--,Qm;b) := 2 Ge ka) E DENE pn) (1.13) 


Sb 





We will study these sums in detail in Chapter 8. With this definition, we have 
arrived at the following result. 


Theorem 1.7. The restricted partition function for A = {a1,a2,...,aa}, 
where the ag’s are pairwise relatively prime, can be computed as 








pa(n) = —B, + Bo —---4+( 1)" Ba + S-n (a2,43,. - . , ad; a1) 
+ S-n (1,03, a4, . . - , ad;a2) +++ + S-n (01,02, . , @d—1; aa) - 
Here Bı, B2,..., Bq are the partial fraction coefficients in the expansion 
(1.11). 











Example 1.8. We give the restricted partition functions for d = 3 and 4. 
These closed-form formulas have proven useful in the refined analysis of the 
periodicity that is inherent in the restricted partition function pa(n). For 
example, one can visualize the graph of P{a,b,c} (n) as a “wavy parabola,” as 
its formula plainly shows. 





ener F.2d te vet SS ge haste 
Pade} 9 che 2 \ab ac bec) 12\a b c' be ac ab 
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Notes 


1. The theory of generating functions has a long and powerful tradition. We 
only touch on its utility. For those readers who would like to dig a little 
deeper into the vast generating-function garden, we strongly recommend Herb 
Wilf’s generatingfunctionology [187] and László Lovasz’s Combinatorial Prob- 
lems and Exercises [122]. The reader might wonder why we do not stress 
convergence aspects of the generating functions we play with. Almost all of 
our series are geometric series and have trivial convergence properties. In the 
spirit of not muddying the waters of lucid mathematical exposition, we omit 
such convergence details. 


2. The Frobenius problem is named after Georg Frobenius, who apparently 
liked to raise this problem in his lectures [41]. Theorem 1.2 is one of the 
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famous folklore results and might be one of the most misquoted theorems 
in all of mathematics. People usually cite James J. Sylvester’s problem in 
[177], but his paper contains Theorem 1.3 rather than 1.2. In fact, Sylvester’s 
problem had previously appeared as a theorem in [176]. It is not known who 
first discovered or proved Theorem 1.2. It is very conceivable that Sylvester 
knew about it when he came up with Theorem 1.3. 


3. The linear Diophantine problem of Frobenius should not be confused with 
the postage-stamp problem. The latter problem asks for a similar determina- 
tion, but adds an additional independent bound on the size of the integer 
solutions to the linear equation. 


4. Theorem 1.5 has an interesting history. The earliest appearance of this 
result that we are aware of is in a paper by Tiberiu Popoviciu [148]. Popoviciu’s 
formula has since been resurrected at least twice [161, 183]. 


5. Fourier-Dedekind sums first surfaced implicitly in Sylvester’s work (see, 
e.g., [175]) and explicitly in connection with restricted partition functions 
in [104]. They were rediscovered in [25], in connection with the Frobenius 
problem. The papers [157, 83] contain interesting connections to Bernoulli 
and Euler polynomials. We will resume the study of the Fourier-Dedekind 
sums in Chapter 8. 


6. As we already mentioned above, the Frobenius problem for d > 3 is much 
harder than the case d = 2 that we have discussed. Certainly beyond d = 3, 
the Frobenius problem is wide open, though much effort has been put into its 
study. The literature on the Frobenius problem is vast, and there is still much 
room for improvement. The interested reader might consult the comprehensive 
monograph [153], which surveys the references to almost all articles dealing 
with the Frobenius problem and gives about 40 open problems and conjectures 
related to the Frobenius problem. To give a flavor, we mention two landmark 
results that go beyond d = 2. 

The first one concerns the generating function r(z) := Ð per2”, where 
R is the set of all integers representable by a given set of relatively prime 
positive integers a),d@2,...,@a. It is not hard to see (Exercise 1.34) that 
r(z) = p(z)/ (1 — 2%) (1 — 2%)--+ (1 — 2%) for some polynomial p. This ra- 
tional generating function contains all the information about the Frobenius 
problem; for example, the Frobenius number is the total degree of the function 
<4 — r(z). Hence the Frobenius problem reduces to finding the polynomial 
p, the numerator of r. Marcel Morales [134, 135] and Graham Denham [73] 
discovered the remarkable fact that for d = 3, the polynomial p has either 4 
or 6 terms. Moreover, they gave semi-explicit formulas for p. The Morales- 
Denham theorem implies that the Frobenius number in the case d = 3 is 
quickly computable, a result that is originally due, in various disguises, to 
Jiirgen Herzog [95], Harold Greenberg [89], and J. Leslie Davison [65]. As 
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much as there seems to be a well-defined border between the cases d = 2 
and d = 3, there also seems to be such a border between the cases d = 3 
and d = 4: Henrik Bresinsky [43] proved that for d > 4, there is no absolute 
bound for the number of terms in the numerator p, in sharp contrast to the 
Morales—Denham theorem. 

On the other hand, Alexander Barvinok and Kevin Woods [14] proved 
that for fixed d, the rational generating function r(z) can be written as a 
“short” sum of rational functions; in particular, r can be efficiently computed 
when d is fixed. A corollary of this fact is that the Frobenius number can 
be efficiently computed when d is fixed; this theorem is due to Ravi Kannan 
[105]. On the other hand, Jorge Ramirez-Alfonsin [152] proved that trying to 
efficiently compute the Frobenius number is hopeless if d is left as a variable. 

While the above results settle the theoretical complexity of the computa- 
tion of the Frobenius number, practical algorithms are a completely different 
matter. Both Kannan’s and Barvinok—Woods’s ideas seem complex enough 
that nobody has yet tried to implement them. Currently, the fastest algo- 
rithm is presented in [32]. 


Exercises 


1.1. & Check the partial fraction expansion (1.2): 


z 1/v5 1/v5 


l-z—-22 1 5,3 1 eg 








1.2. & Suppose z is a complex number, and n is a positive integer. Show that 





(=a) (142+ tee Sao, 
and use this to prove that if |z| < 1, 


1 
aaa 


k>0 





1.3. & Find a formula for the number of lattice points in [a,b] for arbitrary 
real numbers a and b. 


1.4. Prove the following. Unless stated differently, n € Z and x,y E€ R. 
(a) [z +n] = |z] +n. 
(b) læ] + [y] < le +y] < le] + ly] +1. 
etal a 0 ifxeZ, 
) 
) 





—1 otherwise. 
(d) For n € Zso, | | =|ž]. 


n n 
(e) —|—a] is the least integer greater than or equal to x, denoted by [x]. 
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(£) |x + 1/2] is the nearest integer to x (and if two integers are equally near 
to x, it is the larger of the two). 

(g) le] + le +1/2) = [22]. 

(h) If m and n are positive integers, | = | is the number of integers among 
1,...,m that are divisible by n. 

(i) & If m € Zyo,n € Z, then |==] = —|="|-1. 

(j) #& If m € Z>0,n € Z, then |"—*| + 1 is the least integer greater than or 
equal to n/m. 





1.5. Rewrite in terms of the fractional-part function as many of the above 
identities as you can make sense of. 


1.6. Suppose m and n are relatively prime positive integers. Prove that 


` =| -3 =| = 5(m—1)(n-1), 


" n 
k=0 j=0 





1.7. Prove the following identities. They will become handy at least twice: 
when we study partial fractions, and when we discuss finite Fourier series. For 
b, y E Rn E€ Zy0, me Z, 


(a) et = 1, 
(b) ei? ei? — ett), 
(c) 1/el# =e“ 
(d) ettm) = ett, 
(e) e™ =1, 
(£) let] = , 
(g) e? = iett, 
n-1 2rikm/jn_ Jn if nim, 
(h) X k= € E { 0 otherwise, 
š n-1 tik /n n 
(i) yay kert = 


1.8. Suppose m,n € Z and n > 0. Find a closed form for 375 Set ernikm/n 
(as a function of m and n). 


1.9. & Suppose m and n are relatively prime integers, and n is positive. Show 
that 


jena :0<k< n} = E :0<j< n} 


and 
VRE -O<ke n} = some : 0 <J < n} f 


Conclude that if f is any complex-valued function, then 


n—1 n-1 
mimk/n\ — nij/n 
2 f (c? ) 2 f (e J ) 
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and 
Zi 


n—1 
nmimk/n\ — mig (nr ; 
eee ee) 


k=1 


3 


1.10. Suppose n is a positive integer. If you know what a group is, prove 
that the set {e?™*/" : 0 <k <n} forms a cyclic group of order n (under 
multiplication in C). 


1.11. Fix n € Zyo. For an integer m, let (m mod n) denote the least nonneg- 
ative integer in G1 := Zn to which m is congruent. Let’s denote by x addition 
modulo n, and by o the following composition: 


a 
n nS o n i 
defined on the set Gg := TES; mE Z}. Define the following functions: 


ġ((m mod n)) = e?7™/”, 
Y (en — m , 
X ({=}) = (m mod n). 


Prove the following: 
@((m, mod n) x (mz mod n)) = ¢ ((mı mod n)) ¢((mz mod n)), 
w Cai racks =% (rae) a w Comes ; 


My ms, My mM 
a RN AN 
n n n n 
Prove that the three maps defined above, namely ¢, Y, and x, are one-to-one. 
Again, for the reader who is familiar with the notion of a group, let G3 be 
the group of nt! roots of unity. What we have shown is that the three groups 


G1, G2, and G3 are all isomorphic. It is very useful to cycle among these three 
isomorphic groups. 


1.12. & Given integers a, b, c, d, form the line segment in R? joining the point 
(a,b) to (c,d). Show that the number of integer points on this line segment is 
gcd(a — c,b — d) +1. 


1.13. Give an example of a line with 


(a) no lattice point; 
(b) one lattice point; 
(c) an infinite number of lattice points. 


In each case, state—if appropriate—necessary conditions about the (ir)rationa- 
lity of the slope. 
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1.14. Suppose a line y = mx +b passes through the lattice points (p;, q,) and 
(p2, q2). Prove that it also passes through the lattice points 


(pı + k(p2 — pı) qı + kla —M1)), KE Z. 


1.15. Given positive irrational numbers p and q with 3 + i = 1, show that 
Z>o is the disjoint union of the two integer sequences {|pn] : n € Z>o} and 
{lan] : n € Zyo}. This theorem from 1894 is due to Lord Rayleigh and was 
rediscovered in 1926 by Sam Beatty. Sequences of the form {|pn] : n € Zso} 
are often called Beatty sequences. 


1.16. Let a,b,c,d € Z. We say that {(a,b),(c,d)} is a lattice basis of Z? if 
any lattice point (m,n) € Z? can be written as 


(m,n) = p (a,b) +q (c, d) 


for some p,q € Z. Prove that if {(a,b),(c,d)} and {(e, f),(g, h)} are lattice 
bases of Z? then there exists an integer matrix M with determinant +1 such 


that i f 
a e 
C 


Conclude that the determinant of ( J is +1. 





1.17. & Prove that a triangle with vertices on the integer lattice has no other 
interior/boundary lattice points if and only if it has area 4. (Hint: You may 
begin by “doubling” the triangle to form a parallelogram.) 





2° 


1.18. Let’s define a northeast lattice path as a path through lattice points that 
uses only the steps (1,0) and (0,1). Let Ln be the line defined by x + 2y = n. 
Prove that the number of northeast lattice paths from the origin to a lattice 
point on Lp is the (n + 1)*® Fibonacci number fn+1. 


1.19. Compute the coefficients of the Taylor series of 1/(1 — z)? expanded at 
z=0 

(a) by a counting argument, 

(b) by differentiating the geometric series. 


Generalize. 
1.20. & Prove that if a1, a2,...,a@¢ E Z>o do not have a common factor then 
the Frobenius number g(a1,...,@a) is well defined. 


1.21. & Compute the partial fraction coefficients (1.6). 
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1.22. & Prove (1.9): For relatively prime positive integers a and b, 


Hee a-1 


1 1 
ay UE = 


a f= ( k=1 





where b-'b = 1 mod a, and deduce from this (1.10), namely, 


Le 1 _ fbn a 1 
a¢ (1 — 0k) gkn a 2 2a” 


(Hint: Use Exercise 1.9.) 





iM 


1.23. Prove that for relatively prime positive integers a and b, 


Pia} (n + ab) = Prap} (n) +1. 


1.24. & Show that if a and b are relatively prime positive integers, then 


PLa,b} (a + b) =f 


1.25. To extend the Frobenius problem, let us call an integer n k-representable 
if pa (n) = k; that is, n can be represented in exactly k ways using the integers 
in the set A. Define gk = gk(a1,...,aq) to be the largest k-representable 
integer. Prove: 


(a) Let d = 2. For any k € Zo there is an N such that all integers larger than 
N have at least k representations (and hence g;(a,b) is well defined). 

(b) g(a, b) = (k + 1)ab — a — b. 

(c) Given k > 2, the smallest k-representable integer is ab(k — 1). 

(d) The smallest interval containing all uniquely representable integers is 
[min(a, b), gı (a, b)]. 

(e) Given k > 2, the smallest interval containing all k-representable integers 
is [g—2(a,b) + a + b, g(a, b)]. 

(£) There are exactly ab — 1 integers that are uniquely representable. Given 
k > 2, there are exactly ab k-representable integers. 

(g) Extend all of this to d > 3 (see open problems). 


1.26. Find a formula for pta} (n). 


1.27. Prove the following recursion formula: 


m>0 


(Here we use the convention that pa (n) = 0 if n < 0.) Use it in the case d = 2 
to give an alternative proof of Theorem 1.2. 
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1.28. Prove the following extension of Theorem 1.5: Suppose gcd(a,b) = d. 


Then 
p (n) = a {a} E +1 ifd|n, 
a,b z 
ee 0 otherwise, 





where 8 è = 1 mod §, and a § = 1 mod b, 
1.29. & Compute the partial fraction coefficient (1.12). 
1.30. Find a formula for P{a,b,c} (n) for the case gcd(a, b,c) # 1. 
1.31. & With A = {a,,a2,...,aaq} C Zyo, let 
py(n) = # {(mi,...,ma) E€ Z? : all mj > 0, mia, +--+ + maaa =n}; 


that is, p4(n) counts the number of partitions of n using only the elements 
of A as parts, where each part is used at least once. Find formulas for p} for 
A = {a}, A = {a,b}, A = {a,b,c}, A = {a,b,c,d}, where a,b, c, d are pairwise 
relatively prime positive integers. Observe that in all examples, the counting 
functions pa and p% satisfy the algebraic relation 


pal=n) = (-1)**pa(n). 








1.32. Prove that p(n) = pa (n — a, — ag — -+ — aa). (Here, as usual, A = 
{a1,a@2,...,@a}.) Conclude that in the examples of Exercise 1.31 the algebraic 
relation 

pa(—t) = (-1)** pa (t — a — a2 = ++: — aa) 
holds. 


1.33. For relatively prime positive integers a,b, let 
R := {am +bn : m,n € Zso}, 
the set of all integers representable by a and b. Prove that 
y k 1 — z% l 
lege (1 — 2%) (1 — z?) 


Use this rational generating function to give alternative proofs of Theorems 1.2 
and 1.3. 


1.34. For relatively prime positive integers a1, a2,...,@a, let 
R := {mia + Ma2 +--+ + Magda: M1, M2,- .., Ma E Zso}, 
the set of all integers representable by a1, a2,...,aq. Prove that 





oe plz) 
(2) : Ne (1 — 2) (1 — 2%)--- (1 — 244) 


kER 


for some polynomial p. 
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1.35. Prove Theorem 1.1: Given any rational function pyr r) 
=1 


ak where p 


is a polynomial of degree less than e1 + e€2+---+e,, and the a,’s are distinct, 
there exists a decomposition 


m 
5 Ck,1 i Ck,2 i i Ck,ek 
| kiad 
b= Ge (z-a) (z— az)" J? 


k=1 





where the ck j € C are unique. 

Here is an outline of one possible proof. Recall that the set of polynomials 
(over R or C) forms a Euclidean domain, that is, given any two polynomials 
a(z), b(z), there exist polynomials g(z),r(z) with deg(r) < deg(b), such that 


a(z) = W(z)q(z) + r(z). 


Applying this procedure repeatedly (the Euclidean algorithm) gives the great- 
est common divisor of a(z) and b(z) as a linear combination of them, that 
is, there exist polynomials c(z) and d(z) such that a(z)c(z) + b(z)d(z) = 
gcd (a(z), b(z)). 


Step 1: Apply the Euclidean algorithm to show that there exist polynomials 
u1, uz such that 


uz(z) (z — a1)” + u2(2) (z — a2)? = 1. 


Step 2: Deduce that there exist polynomials v1, v2 with deg (vk) < ex such 
that 








(z= a1)" (z-a)? (z-a) (z-a)? 


(Hint: Long division.) 
Step 3: Repeat this procedure to obtain a partial fraction decomposition for 


p(o) o ne) ale) 





Open Problems 


1.36. Come up with a new approach or a new algorithm for the Frobenius 
problem in the d = 4 case. 


1.37. There are a very good lower [65] and several upper bounds [153, Chap- 
ter 3] for the Frobenius number. Come up with improved upper bounds. 


1.38. Solve Vladimir I. Arnold’s Problems 1999-8 through 1999-11 [7]. To give 
a flavor, we mention two of the problems explicitly: 
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(a) Explore the statistics of g (a1, a2,...,@q) for typical large a1, a2,..., dq. It 
is conjectured that g (a1, @2,...,@a) grows asymptotically like a constant 
times 4>/a1d2°*> Gq. 

(b) Determine what fraction of the integers in the interval [0, g (a1, a2,..., @a)| 
is representable, for typical large aj, a2,...,aq. It is conjectured that this 


fraction is asymptotically equal to L, (Theorem 1.3 implies that this con- 
jecture is true in the case d = 2.) 


1.39. Study vector generalizations of the Frobenius problem [155, 164]. 


1.40. There are several special cases of A = {a1,@2,...,@a} for which the 
Frobenius problem is solved, for example, arithmetic sequences [153, Chap- 
ter 3]. Study these special cases in light of the generating function r(x), defined 
in the Notes and in Exercise 1.34. 


1.41. Study the generalized Frobenius number gẹ (defined in Exercise 1.25), 
e.g., in light of the Morales-Denham theorem mentioned in the Notes. Derive 
formulas for special cases, e.g., arithmetic sequences. 


1.42. For which 0 < n < b— 1 is sn (1, @2,...,aa;b) = 0? 
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A Gallery of Discrete Volumes 


Few things are harder to put up with than a good example. 


Mark Twain (1835-1910) 


A unifying theme of this book is the study of the number of integer points 
in polytopes, where the polytopes lives in a real Euclidean space R?. The 
integer points Zt form a lattice in R?, and we often call the integer points lat- 
tice points. This chapter carries us through concrete instances of lattice-point 
enumeration in various integral and rational polytopes. There is a tremendous 
amount of research taking place along these lines, even as the reader is looking 
at these pages. 


2.1 The Language of Polytopes 


A polytope in dimension 1 is a closed interval; the number of integer points in 
[¢, S] is easily seen to be | $| —|%*]| (Exercise 2.1). A 2-dimensional convex 
polytope is a convex polygon: a compact convex subset of R? bounded by 
a simple, closed curve that is made up of finitely many line segments. 

In general dimension d, a convex polytope is the convex hull of finitely 
many points in Rê. To be precise, given any finite point set {v1, V2,..., Vn} C 
R?, the polytope P is the smallest convex set containing those points; that is, 


P = {Aivi tAgvea +++ + AnVn : all Ay > 0 and Ay + Ag +- + An = 1}. 


This definition is called the vertex description of P, and we use the notation 


P = conv {v1,V2,.--,Vn}, 


the convex hull of v1, v2,...,Vn. In particular, a polytope is a closed sub- 
set of R?. Many polytopes we will study, however, are not defined this way, 
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but rather as bounded intersections of finitely many half-spaces and hyper- 
planes. One example is the polytope P defined by (1.4) in Chapter 1. This 
hyperplane description of a polytope is, in fact, equivalent to the vertex 
description. The fact that every polytope has both a vertex and a hyperplane 
description is highly nontrivial, both algorithmically and conceptually. We 
carefully work out a proof in Appendix A. 

The dimension of a polytope P is the dimension of the affine space 


span P := {x + \(y—x): xy EP, AER} 


spanned by P. If P has dimension d, we use the notation dim P = d and call 
P ad-polytope. Note that P C R? does not necessarily have dimension d. For 
example, the polytope P defined by (1.4) has dimension d — 1. 

Given a convex polytope P C R, we say that the hyperplane H = 
{x €Rt:a-x= b} is a supporting hyperplane of P if P lies entirely on 
one side of H, that is, P C {xE R: a-x < bh or Pc {xe Rt: a-x>)}. 
A face of P is a set of the form P N H, where H is a supporting hyperplane 
of P. Note that P itself is a face of P, corresponding to the degenerate hyper- 
plane R¢,! and the empty set Ø is a face of P, corresponding to a hyperplane 
that does not meet P. The (d — 1)-dimensional faces are called facets, the 1- 
dimensional faces edges, and the 0-dimensional faces vertices of P. Vertices 
are the “extreme points” of a polytope. 

A convex d-polytope has at least d+ 1 vertices. A convex d-polytope with 
exactly d+ 1 vertices is called a d-simplex. Every 1-dimensional convex poly- 
tope is a l-simplex, namely, a line segment. The 2-dimensional simplices are 
the triangles, the 3-dimensional simplices the tetrahedra. 

A convex polytope P is called integral if all of its vertices have inte- 
ger coordinates, and P is called rational if all of its vertices have rational 
coordinates. 


2.2 The Unit Cube 














As a warm-up example, we begin with the unit d-cube O := [0,1]¢, which 
simultaneously offers simple geometry and an endless fountain of research 
questions. The vertex description of O is given by the set of 2¢ vertices 
{(a1,@2,...,@q): all x, =0 or 1}. The hyperplane description is 























= { (£1, £2,..., £4) E R? : OX £p < 1 for all k = 1,2,...,d}. 





Thus, there are the 2d bounding hyperplanes xı = 0, zı = 1, z2 = 0, T2 = 
1, e; La = 0, ta=l; 


1 In the remainder of the book, we will reserve the term hyperplane for non- 
degenerate hyperplanes, i.e., sets of the form {x ERt:a-x= b}, where not all 
of the entries of a are zero. 
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We now compute the discrete volume of any integer dilate of O. That is, 
we seek the number of integer points t ON Zê for all t € Zyo. Here tP denotes 
the dilated polytope 

















{(ta,,tao,..., tza): (£1, £2,..., £4) E P}, 














for any polytope P. What is the discrete volume of O? We dilate by the 
positive integer t, as depicted in Figure 2.1, and count: 














# (tOn Z?) = #((0, tN Z?) = (t +1). 














eo o oe o o oo Ž— 
6 v1 











Fig. 2.1. The sixth dilate of O in dimension 2. 





We generally denote the lattice-point enumerator for the tt? dilates of 
P C R? by 
Lp(t) := #(tP NZ‘), 
a useful object that we also call the discrete volume of P. We may also 
think of leaving P fixed and shrinking the integer lattice: 


Lp(t) = # (Paiz). 


With this convention, Lo(t) = (t+1)%, a polynomial in the integer variable t. 
Notice that the coefficients of this polynomial are the binomial coefficients 
(2), defined through 


(e) £ m(m — 1)(m — 2)---(m—n +1) (2.1) 


n n! 





for m € C, n € Zyo. 
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What about the interior O° of the cube? The number of interior integer 
points in ¢ O° is 




















Loo(t) = # (O° NZ?) = # ((0,t)?NZ%) = t- 1)”. 











Notice that this polynomial equals (—1)?Lo(—t), the evaluation of the poly- 
nomial Lo(t) at negative integers, up to a sign. 

We now introduce another important tool for analyzing any polytope P, 
namely the generating function of Lp: 


Ehrp(z) := 1 + 5 Lp(t)z*. 
t>1 


This generating function is also called the Ehrhart series of P. 
In our case, the Ehrhart series of P = O takes on a special form. To 
illustrate, we define the Eulerian number A (d, k) through 














Y jii = Zizo A (d, k) 2 l (2.2) 


a ee 


It is not hard to prove that the polynomial oer A (d, k) z is the numerator 
of the rational function 


aN? 1) d df 1 
FE l-z = i Pr l1—=zj` 
==> 


d times 








The Eulerian numbers have many fascinating properties, including 


A(d,k) = A(d,d+1—h), 
A(d,k) =(d—k+1)A(d—1,k—-1)+kA(d—-1,k), 


d 
S°A(d,k) = dl, (2.3) 
k=0 
see afer s 
Ak) = Yay ( tye 


d= 0: 

d=1: 01 
d=2: 011 

d=3: 014 1 

d=4: 0111 11 1 

d=5: 0126 66 26 1 
d=6: 0157 302 302 571. 
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(See also [165, Sequence A008292].) 
With this definition, we can now express the Ehrhart series of O in terms 
of Eulerian numbers: 


Ehro(z) =1+ $ ¢+1) z =X +1) = See 


t>1 t>0 t>1 














Ss Ae 
~ (1=2z)4 





To summarize, we have proved the following theorem. 











Theorem 2.1. Let O be the unit d-cube. 

















(a) The lattice-point enumerator of O is the polynomial 


d 


haett ey e 


k=0 
(b) Its evaluation at negative integers yields the relation 


(-1)4La(—t) = Lo (t). 


























d k—1 
(c) The Ehrhart series of O is Ehro(z) = Bi E. 


2.3 The Standard Simplex 


The standard simplex A in dimension d is the convex hull of the d + 1 
points €1, €2,.. ., €q and the origin; here e; is the unit vector (0,...,1,...,0), 
with a 1 in the j*" position. Figure 2.2 shows A for d = 3. On the other hand, 
A can also be realized by its hyperplane description, namely 


A = {(x1,02...,%a) € R*: ay +22 +: + £a <1 and all £p > 0}. 
In the case of the standard simplex, the dilate tA is now given by 
tA = { (£1, £2,..., £a) ER? : a +z +--+ zra <t and all £k >0}. 


To compute the discrete volume of A, we would like to use the methods devel- 
oped in Chapter 1, but there’s an extra twist. The counting functions in Chap- 
ter 1 were defined by equalities, whereas the standard simplex is defined by an 
inequality. We are trying to count all integer solutions (m1, M2, ..., Ma) € hae 
to 7 
Mı +M +°- +M <St. (2.4) 


To translate this inequality in d variables into an equality in d + 1 vari- 
ables, we introduce a slack variable Mma+ı E Z>o, which picks up the dif- 
ference between the right-hand and left-hand sides of (2.4). So the number 
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A z2 





Fig. 2.2. The standard simplex A in dimension 3. 


of solutions (m1, M2,..., Ma) € ZS to (2.4) equals the number of solutions 
(m1, M2,..., Mda41) € Zeo 
Mı +M +: + Map =t. 
Now the methods of Chapter 1 apply: 
# (tANZ*) 


= const y gut J yma)... ` gmat | zot 


mı>0 m2>0 Ma4120 


Sgi (a=) (2.5) 


In contrast with Chapter 1, we do not require a partial fraction expansion but 
simply use the binomial series 


1 d+k 
(1— 2) T D ( d ie (2.6) 
k>0 

for d > 0. The constant-term identity (2.5) requires us to find the coefficient 
of z* in the binomial series (2.6), which is (ETA Hence the discrete volume 
of A is given by La(t) = (T; a polynomial in the integer variable t of 
degree d. Incidentally, the coefficients of this polynomial function in t have an 
alternative life in traditional combinatorics: 


RDE 1)¢-* stirl(d +1, k +1)t" 
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where stirl(n, j) is the Stirling number of the first kind (see Exercise 2.11). We 
also notice that (2.6) is, by definition, the Ehrhart series of A. 

Let us repeat this computation for the interior A° of the standard d- 
simplex. Now we introduce a slack variable mg+1 > 0, so that strict inequality 
is forced: 





Lac(t) = # { (m1, m2,..., ma) € Zo : mı +m + +m <t} 


= #{(mi,m2,...,ma4i) € ZS" : mı +m +: + Map =t}. 


Lao (t) = const ( >D e) ( 5 e res DD zmMa+ı | zt 


| mı>0 m2>0 Ma+41>0 
= const (= 





re 
Pee a) oh 


= const 


aCe) 


It is a fun exercise to prove that 


(47) = (S) (2.7) 


(see Exercise 2.10). We have arrived at our destination: 


Theorem 2.2. Let A be the standard d-simplez. 


(a) The lattice-point enumerator of A is the polynomial La (t) = (E; 
(b) Its evaluation at negative integers yields (—1)4La(—t) = Lao (t). 


(c) The Ehrhart series of A is Ehra(z) 














= Cee 


2.4 The Bernoulli Polynomials as Lattice-Point 
Enumerators of Pyramids 


There is a fascinating connection between the Bernoulli polynomials and cer- 
tain pyramids over unit cubes. The Bernoulli polynomials B(x) are de- 
fined through the generating function 





= z (2.8) 
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and are ubiquitous in the study of the Riemann zeta function, among other 
objects; they are named after Jacob Bernoulli (1654-1705).? The Bernoulli 
polynomials will play a prominent role in Chapter 10 in the context of Euler- 
Maclaurin summation. The first few Bernoulli polynomials are 











Bo(x) =1, 
By(a)=a--, 
By(a) =a? — 245, 
Bs(x) = a" — 52° + 52, 

1 
B(x) = 2" — 22° + ° — 5, 
B;(x) = «° oat 4 ; $ am, 
Be(x) = zê — 32° 4 oat Lg? H a 
Br(x) = x" Zas l a8 | latis 


The Bernoulli numbers are B;, := B;,(0) (see also [165, Sequences A000367 
& A002445]) and have the generating function 


By 
= => a 
k>0 


Lemma 2.3. For integers d > 1 and n > 2, 


n-1 


eS I (Ba(n) — Ba). 


k=0 


Proof. We play with the generating function of Bal) Ba. 


Ba(n)— B e” nl kz) 
yy Fae Fe a = =y ms 5 E 


d>0 f k=0 j>0 


ahe e 


j20 jl 























Now compare coefficients on both sides. 





? For more information about Bernoulli, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Bernoulli_Jacob.html. 
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Consider a (d — 1)-dimensional unit cube embedded into R? and form 
a d-dimensional pyramid by adjoining one more vertex at (0,0,...,0,1), as 
depicted in Figure 2.3. More precisely, this geometric object has the following 
hyperplane description: 


P= { (£1, £2,..., £a) ER? : 0< z1, £2,...,£4-1 Ll- z4 < 1}. (2.9) 


By definition, P is contained in the unit d-cube; in fact, its vertices are a 
subset of the vertices of the d-cube. 


T3 


--------> 


T2 # 


Fig. 2.3. The pyramid P in dimension 3. 


We now count lattice points in integer dilates of P. This number equals 
#{(m, mM2,..., ma) EZI: 0< me <t- ma <t for all k = 1,2,...,d— 1}. 


In this case we just count the solutions to 0 < Mmk < t— Ma < t directly: once 
we pick the integer ma (between 0 and t), we have t — ma + 1 independent 


choices for each of the integers m1, M2, ..., Ma—-1. Hence 
t t+1 
Lp(t)= >> (t-—ma+1)* aoe kt) = = (Balt +2)— Ba), (2.10) 
mia=0 


by Lemma 2.3. This is, naturally, a polynomial in t. 
We now turn our attention to the number of interior lattice points in P: 


- a, O0< me <t—mg<t 
Lepa(t) =# (mmama) € 2 ' forall k =1,2,...,d—1 
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By a similar counting argument, 


t-1 


Lpo(t)= X (t—ma-1)* ee 1> «(Balt - 1) - Ba). 


mag=1 


Incidentally, the Bernoulli polynomials are known (Exercise 2.15) to have the 
symmetry 


Ba(1 — x) = (—1)4Ba(a). (2.11) 
This identity coupled with the fact (Exercise 2.16) that 
Ba = 0 for all odd d > 3 (2.12) 


gives the relation 


7 (Ba (1 - (t= D) = Ba) 


= (-1)* 5 (Balt ~ 1) — Ba) = (-1)* Lpo (t). 


Next we compute the Ehrhart series of P. We can actually do this in 
somewhat greater generality. Namely, for a (d — 1)-polytope Q with vertices 
V1,V2,---;Vm, define Pyr(Q), the pyramid over Q, as the convex hull of 
(v1,0),(v2,0),..., (Vm, 0), (0,...,0,1). In our example above, the d-polytope 
P is equal to Pyr(Q) for the unit (d—1)-cube O. The number of integer points 
in t Pyr(Q) is, by construction, 








Lp(-t) = 5 (Ba(-t +2) — Ba) = 





























Loyo) (t) = 1+ Lo(1) + Lo(2) +--+ + Lolt = 14D told 


because in ¢t Pyr(Q), there is one lattice point with see ke t, Loll) 
lattice points with xa-coordinate t— 1, Lo (2) lattice points with za-coordinate 
t—2, etc., up to Lo (t) lattice points with za = 0. Figure 2.4 shows the instance 
t = 3 for a pyramid over a square. 

This identity for Lpyr(o)(t) allows us to compute the Ehrhart series of 
Pyr(Q) from the Ehrhart series of Q: 
Theorem 2.4. Ehrpyr(o) (2) = ea ; 


Proof. 


Ehrpyr(o)(2) = 1+ Leyo t) 51) + Dbl) zt 


t>1 t>1 


ais DE 


t>0 t>1 j=1 j>1 t>j 




















jj 1+5 Lo(j) 24 
= +)" Loli) = yet o(j) , 


fs l-z 
jol 
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tas 


T2 » 


Fig. 2.4. Counting the lattice points in t Pyr(Q). 
Our pyramid P that started this section is a pyramid over the unit (d—1)- 
cube, and so 


1 EZ] A(d-1,k)z-! Y A(d-1,k) z"! 
l-z (=z) z (1— z)! 





Ehrp(z) = (2.13) 


Incidentally, this Ehrhart series gives rise to a generating function (different 
from (2.8)) for the Bernoulli polynomial Ba (see Exercise 2.22). 
Let’s summarize what we have proved for the pyramid over the unit cube. 


Theorem 2.5. Let P be the d-pyramid 
P = {(2x1,%2,...,%4) ER? : 0 < z1, £2,...,£d—1 Í 1 — Ta < 1}. 


(a) The lattice-point enumerator of P is the polynomial 
1 
Lp(t) = P (Ba(t + 2) — Ba). 


(b) Its evaluation at negative integers yields (—1)*Lp(—t) = Lpo(t). 
(c) The Ehrhart series of P is Ehrp(z) = ear Ald- Lk) 


O-z 

















Patterns are emerging... 
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2.5 The Lattice-Point Enumerators 
of the Cross-Polytopes 


Consider the cross-polytope © in R given by the hyperplane description 
© := { (£1, £2,... £4) E R? : lay] +|xo| +--+ + lza] <1}. (2.14) 


Figure 2.5 shows the 3-dimensional instance of ©, an octahedron. The vertices 
of © are (+1,0,...,0),(0,41,0,...,0),...,(0,...,0,4£1). 














Fig. 2.5. The cross-polytope © in dimension 3. 


To compute the discrete volume of ©, we use a process similar to that of 
Section 2.4. Namely, for a (d — 1)-polytope Q with vertices v1, V2,..., Vm, 
define BiPyr(Q), the bipyramid over Q, as the convex hull of 


(v1,0),(v2,0),-.-,(Vm,0), (0,2. 0,1), and (Ope ag Opa x 


In our example above, the d-dimensional cross-polytope is the bipyramid 
over the (d — 1)-dimensional cross-polytope. The number of integer points in 
t BiPyr(Q) is, by construction, 


LBiPyr(Q) (t) = 2+ ete + 2L9(2) bet tg 2Lo(t 1) t Lo(t) 








=2429 bal )+ Lolt). 


This identity allows us to compute the Ehrhart series of BiPyr(Q) from the 
Ehrhart series of Q, in a manner similar to the proof of Theorem 2.4. We leave 
the proof of the following result as Exercise 2.23. 
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Theorem 2.6. If Q contains the origin, then Ehrgipyr(g)(Z) = itz Ehro(z). 














This theorem allows us to compute the Ehrhart series of © effortlessly: 
The cross-polytope © in dimension 0 is the origin, with Ehrhart series i. 
The higher-dimensional cross-polytopes can be computed recursively through 
Theorem 2.6 as 





(1+2)? 
(1—2) ` 
Since Ehro(z) = 1 + 32,5, Lo(t)2*, we can retrieve Lo(t) by expanding 
Ehro(z) into its power series at z = 0: 


(Tea) pe (2 


G-a 2H 
d 


a oe P 


Ehro(z) = 





Ehro(z) = 





t>0 0 


See 


In the last step we used the fact that (arre) = 0 for 0 < t < k. But then 


T : -DS (i ED 


t>1 t>0 k=0 


and hence Lo (t) = ues (2) (a Fao for allt > 1. 
We finish this section by counting the interior lattice points in tO. We 
start by noticing, since t is an integer, that 














Loo (t) = # {(mi, ma,...,ma) E Z*: [mı] [mol ii [mal <t} 
= #{(m, m2... ma) € Z4 : |my| + ma] ++ mal <t- 1} 
= Lo(t- 1). 
On the other hand, we can use (2.7): 
d 





T l 


> 
ll 
° 


Il 
Mea 


EOC 
ewe) 
Caney 


=0 
= (-1)*Lo(t- 


> 
ll 
° 
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Comparing the last two computations, we see that (—1)4Lo(—t) = Loe(t). 
Let us summarize: 


Theorem 2.7. Let © be the cross-polytope in R°. 
a) The lattice-point enumerator of © is the polynomial 
(a) p poly 
d 
d\ (t—k+d 
Lo(t) = ; 
=a) 
k=0 
(b) Its evaluation at negative integers yields (—1)4Lo(—t) = Loo(t). 
d 

(c) The Ehrhart series of P is Bhro(z) = +2 


= G2): 

















2.6 Pick’s Theorem 


Going back to basic concepts, we now give a complete account of Lp for all 
integral convex polygons P in R?. Denote the number of integer points inside 
the polygon P by J, and the number of integer points on the boundary of P by 
B. The following result, called Pick’s theorem in honor of its discoverer Georg 
Alexander Pick (1859-1942), presents the astonishing fact that the area A of 
P can be computed simply by counting lattice points: 


Theorem 2.8 (Pick’s theorem). For an integral convex polygon, 
1 
A=I+=B-1. 
2 
Proof. We start by proving that Pick’s identity has an additive character: we 


can decompose P into the union of two integral polygons Pı and P2 by joining 
two vertices of P with a line segment, as shown in Figure 2.6. 


Fig. 2.6. Decomposition of a polygon into two. 


We claim that the validity of Pick’s identity for P follows from the validity 
of Pick’s identity for Pı and P2. Denote the area, number of interior lattice 


2.6 Pick’s Theorem 39 


points, and number of boundary lattice points of Py, by Ax, Ik, and By, 
respectively, for k = 1,2. Clearly, 


A=A,+ 42. 


Furthermore, if we denote the number of lattice points on the edge common 
to Pı and P2 by L, then 


P=h+h+Ll-2 and B = Bı + B2 — 2L +2. 











Thus 
1 1 1 
[+5B-1l=htht+l 2+ 5Bit 5 Be E+1-1 
=I lp 14+ -Íp dl 
=i at 2 +322 . 


This proves the claim. Note that our proof also shows that the validity of 
Pick’s identity for P4 follows from the validity of Pick’s identity for P and P2. 





Fig. 2.7. Triangulation of a polygon. 


Now, any convex polygon can be decomposed into triangles that share a 
common vertex, as illustrated in Figure 2.7. Hence it suffices to prove Pick’s 
theorem for triangles. Further simplifying the picture, we can embed any 
integral triangle into an integral rectangle as suggested by Figure 2.8. 

















Fig. 2.8. Embedding a triangle in a rectangle. 
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This reduces the proof of Pick’s theorem to proving the theorem for integral 
rectangles whose edges are parallel to the coordinate axes, and for rectangular 
triangles two of whose edges are parallel to the coordinate axes. These two 
cases are left to the reader as Exercise 2.24. 














Pick’s theorem allows us not only to count the lattice points strictly inside 
the polygon P but also the total number of lattice points contained in P, 
because this number is 


1 1 
I+B=A-SB+1+B=A+5B+1. (2.15) 


From this identity, it is now easy to describe the lattice-point enumerator Lp: 


Theorem 2.9. Suppose P is an integral conver polygon with area A and B 
lattice points on its boundary. 


(a) The lattice-point enumerator of P is the polynomial 
Lp(t) = At? + 5 Bit i 
(b) Its evaluation at negative integers yields the relation 
Lp(—t) = Lpo (t). 
(c) The Ehrhart series of P is 


(A-2 +1)z2+(4+2-2)z+1 
(1-2)? l 





Ehrp(z) = 


Note that in the numerator of the Ehrhart series, the coefficient of z? is Lpo (1), 
and the coefficient of z is Lp(1) — 3. 


Proof. Statement (a) follows from (2.15) if we can prove that the area of tP 
is At?, and the number of boundary points on tP is Bt, which is the content 
of Exercise 2.25. Statement (b) follows with Lp. (t) = Lp(t) — Bt. Finally, the 
Ehrhart series is 


Ehrp(z) =1+ X Lp(t) z 











t>1 
= (4+ tti) 

t>0 
= zoe B A l 1 
(hae ' 2 (1-2) rae 
(A-2 +1)? +(4+2-2)z+1 














(= 2)’ 
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2.7 Polygons with Rational Vertices 


In this section we will establish formulas for the number of integer points in 
any rational convex polygon and its integral dilates. 

A natural first step is to fix a triangulation of the polygon P, which reduces 
our problem to that of counting integer points in rational triangles. However, 
this procedure merits some remarks. After counting lattice points in the tri- 
angles, we need to put those back together to form the polygon. But then we 
need to take care of the overcounting on line segments (where the triangles 
meet). Computing the number of lattice points on rational line segments is 
considerably easier than enumerating lattice points in 2-dimensional regions; 
however, it is still nontrivial (see Popoviciu’s Theorem 1.5). 

After triangulating P, we can further simplify the picture by embedding an 
arbitrary rational triangle in a rational rectangle as in Figure 2.8. To compute 
lattice points in a triangle, we can first count the points in a rectangle with 
edges parallel to the coordinate axes, and then subtract the number of points 
in three right triangles, each with two edges are parallel to the axes, and 
possibly another rectangle, as shown in Figure 2.8. Since rectangles are easy 
to deal with (see Exercise 2.2), the problem reduces to finding a formula for 
a right triangle two of whose edges are parallel to the coordinate axes. 

We now adjust and expand our generating-function machinery to these 
right triangles. Such a triangle 7 is a subset of R? consisting of all points 
(x,y) satisfying 


b 
Lz Y2 g eet+fy sr 


a 
d 
for some integers a,b, d,e, f,r (with ea+ fb < rd; otherwise, the triangle would 
be empty). Because the lattice point count is invariant under horizontal and 
vertical integer translations and under flipping about the x- or y-axis, we may 
assume that a,b,d,e,f,r > 0 and a,b < d. (One should meditate about this 
fact for a minute.) Thus we arrive at the triangle 7 depicted in Figure 2.9. 
To make our life a little easier, let’s assume for the moment that e and f 
are relatively prime; we will deal with the general case in the exercises. So let 





b 
T={(eyeR s>, v2 5, ect fysrh. (2.16) 
To derive a formula for 
2 ta tb 
Lr(t)=#4 (m,n) EZ :m> g PZ g UT LEAT 


we want to use methods similar to those in Chapter 1. As in Section 2.3, we 
introduce a slack variable s: 


Ire =#{ (mn) eZ: m> n> Ë, em + faste) 








= #{ (mns) eZ: mea, iat, s>0, em fnts=trh. 
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4 (G74) 





(23) Z. 





8y 


Fig. 2.9. A right rational triangle. 


This counting function can now, as earlier, be interpreted as the coefficient of 


zt" in the function 


em} foe] (ee 


m> te n> t s>0 


Here the subscript (e.g., m > t4) under a summation sign means “sum over 
all integers satisfying this condition.” For example, in the first sum we start 
with the least integer greater than or equal to ta, which is denoted by [t] 
(and is equal to | “>+| + 1 by Exercise 1.4 (j)). Hence the above generating 
function can be rewritten as 





em n s [ ‘tle [aly 1 
Dd, Z 3 2 27 =i pi l1-z 


mf J emn 7 \e20 





guru 
-“Gd-wa-aAa-y 2) 








where we have introduced, for ease of notation, 
t tb 
U i= $ e and y= H f. (2.18) 


To extract the coefficient of 2*" of our generating function (2.17), we use 
familiar methods. As usual, we shift this coefficient to a constant term: 
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guru—tr 

Lr(t) = t 

rO = const (r A) 


= const (a ze) (1 a (1 a=] ; 


Before we apply the partial fraction machinery to this function, we should 
make sure that it is indeed a proper rational function, that is, that the total 
degree satisfies 

















u+v-tr—e-f-1<0 (2.19) 


(see Exercise 2.31). Then we expand into partial fractions (here we’re using 
our assumption that e and f do not have any common factors!): 


1 














A Bj Zg <" D 
aoe j a DD A 5 SE: (2.20) 


As numerous times before, the coefficients D; do not contribute to the con- 
stant term, so that we obtain 





B 
Lr(t) = j a Ci +C- 03. (2.21) 
j f 


We invite the reader to compute the coefficients appearing in this formula 
(Exercise 2.32): 











gitt) 
Aj = a —, 
e(1- 2) a-g) 
l(u—tr+1) 
Bı = g 


f(i-er) a—e) 
eee (u+v-—tr)? ; a 1 ga i +) : G 1 1) 
e 











2ef 2 ef e'f) 4 f 
n (5 ! 5 ! f), (2.22) 
"E u | 1 | L 
G=-5. 


Putting these ingredients into (2.21) yields the following formula for our 
lattice-point count. 
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Theorem 2.10. For the rectangular rational triangle T given by (2.16), where 
e and f are relatively prime, 





























Lr) = zr u v)? +i (ir u JE H 5) 
ly E E OS aca oes a 
Ge at a 
{ot giet) 15 1 TA tr) 
PEE E 


This identity can be rephrased in terms of the Fourier-Dedekind sum that 
we introduced in (1.13): 


eS Cree oa y Erea 
ltt Pate es 


+ So- f, 1; e) $ Su—tr(e, 1; f) $ 








The general formula for Ly—not assuming that e and f are relatively 
prime—is the content of Exercise 2.34. 

Let us pause for a moment and study the nature of Ly as a function 
of t. Aside from the last two finite sums (which will be put in the spotlight 
in Chapter 8) and the appearance of u and v, Ly is a quadratic polynomial 
in t. And in those two sums, t appears only in the exponent of roots of unity, 
namely as the exponent of € and £f. As a function of t, € is periodic with 
period e, and similarly bt is periodic with period f. We should also remember 
that u and v are functions of t; but they can be easily written in terms of 
the fractional-part function, which again gives rise to periodic functions in t. 
So Ly(t) is a (quadratic) “polynomial” in t, whose coefficients are periodic 
functions in t. This is reminiscent of the counting functions of Chapter 1, which 
showed a similar periodic-polynomial behavior. Inspired by both examples, we 
define a quasipolynomial Q as an expression of the form Q(t) = c(t) t” + 

-- + c(t) t+ co(t), where co,...,Cn are periodic functions in t. The degree 
of Q is n, and the least common period of cp,...,Cn is the period of Q. 
Alternatively, for a quasipolynomial Q, there exist a positive integer k and 
polynomials po, pi,.--,Px—1 Such that 


Q(t) = po(t) if t = 0 mod k, 
Q(t) = pı (t) ift = 1 mod k, 


Q(t) eee ift =k—1 mod k. 


3 Here we tacitly assume that cn is not the zero function. 
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The minimal such k is the period of Q, and for this minimal k, the polynomials 
Po, P1, ---,Pk—1 are the constituents of Q. 

By the triangulation and embedding-in-a-box arguments that started this 
section, we can now state a general structural result for rational polygons. 


Theorem 2.11. Let P be any rational polygon. Then Lp(t) is a quasipolyno- 
mial of degree 2. Its leading coefficient is the area of P (in particular, it is a 
constant). 


We have the technology at this point to also study the period of Lp; we 
let the reader enjoy the ensuing details (see Exercise 2.35). 


Proof. By Exercises 2.2 and 2.34 (the general form of Theorem 2.10), the the- 
orem holds for rational rectangles and right triangles whose edges are parallel 
to the axes. Now use the additivity of both degree-2 quasipolynomials and 
areas, and Popoviciu’s theorem (Theorem 1.5). 














2.8 Euler’s Generating Function for General Rational 
Polytopes 


By now we have computed several instances of counting functions by set- 
ting up a generating function that fits the particular problem we’re interested 
in. In this section, we set up such a generating function for the lattice-point 
enumerator of any rational polytope. Such a polytope is given by its hyper- 
plane description as an intersection of half-spaces and hyperplanes. The half- 
spaces are algebraically given by linear inequalities, the hyperplanes by linear 
equations. If the polytope is rational, we can choose the coefficients of these 
inequalities and equations to be integers (Exercise 2.7). To unify both de- 
scriptions, we can introduce slack variables to turn the half-space inequalities 
into equalities. Furthermore, by translating our polytope into the nonnegative 
orthant (we can always shift a polytope by an integer vector without changing 
the lattice-point count), we may assume that all points in the polytope have 
nonnegative coordinates. In summary, after a harmless integer translation, we 
can describe any rational polytope P as 


P= {xE RZ: Ax=b} (2.23) 


for some integral matrix A € Z™*4 and some integer vector b € Z™. (Note 


that d is not necessarily the dimension of P.) To describe the tt dilate of P, 
we simply scale a point x € P by +, or alternatively, multiply b by t: 


tP = {xe RS): AŽ =b} = {xe Rd, : Ax=tb}. 
Hence the lattice-point enumerator of P is the counting function 


Lp(t) = # {x € Zlo : Ax = tb}. (2.24) 
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Example 2.12. Suppose P is the quadrilateral with vertices (0,0), (2,0), 
(1,1), and (0, 3): 











The half-space-inequality description of P is 


o 


= 2, 
P= { (cite) ER : £1, £2 > 0, ay +29 <2 


Lp(t) = # { (ous) EZ: 21,22 > 0, 7! +222 < a 


Ly + £o < 2t 


4. 
= # [leraz as 2a) E€ Z? : £1, £2, £3, £4 > 0, 


1210 3 
=#fxe Zh: Ge 


Using the ideas from Sections 1.3, 1.5, 2.3, and 2.7, we now construct a gen- 
erating function for this counting function. In these previous sections, the 
lattice-point enumerator could be described with only one nontrivial linear 
equation, whereas now we have a system of such linear constraints. However, 
we can use the same approach of encoding the linear equation into geometric 
series; we just need more than one variable. When we expand the function 


1 
(1 — 2122) (1 — 2?z2) (1 — 21) (1 — 22) 23t 22t 


£1 + 202+ £3 = 3t, 
£1 + £2 + T4 = 2t 





f (21,22) := 


into geometric series, we have 


f (21,22) = ( Sy aa)” ) ( >, aa" ) ( eS a) ( x a) a 


nı>0 n2>0 n3>0 n4>0 
= 5 zy +2n2+n3 ae tn2+na 2t 





When we compute the constant term (in both z1 and z2), we are counting 
solutions (n1, n2, ng, n4) € ZŻg to 
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nı 
1210 no | /3t 
1101 nz} \2t)}? 
n4 
that is, the constant term of f (21, z2) counts the integer points in P: 


1 


Lp(t) = : 
Pont (1 — 2122) (1 — 2229) (1 — 21) (1 — 22) 23t 23? 





We invite the reader to actually compute this constant term (Exercise 2.36). 
It turns out to be 




















7 5 7+(-1) 
Peg 
4 2 8 
Returning to the general case of a polytope P given by (2.23), we denote 
8 g polytop g y , 
the columns of A by c1,¢€2,...,¢g. Let Z = (21, Z2,..-.,2%m) and expand the 
function 1 
2.25 
(1 — z°1) (1 — ze2)--- (1 — zea) ztb ( ) 
in terms of geometric series: 
n120 n220 na>O0 
Here we use the abbreviating notation z° := z[125?---z@ for the vectors 


Z = (21, 22,---,2m) E€ C™ and c = (c1,¢2,...,¢m) € Z™. When multiplying 
out everything, a typical term will look like 





{C1 + N2C2 ++: + Naca — tb = An — tb, 


where n = (n1,n2,..., Nna) € Z2. That is, if we take the constant term of our 
generating function (2.25), we’re counting integer vectors n € Z{ g satisfying 


An-—tb=0, that is, An= tb. 
So this constant term will pick up exactly the number of lattice points n € Ze 
in tP: E 


Theorem 2.13 (Euler’s generating function). Suppose the rational poly- 
tope P is given by (2.23). Then the Ehrhart quasipolynomial of P can be 
computed as follows: 


1 
(1—z°:1)(1 ay ye) : 


We finish this section with rephrasing this constant-term identity in terms 
of Ehrhart series. 

















Lp(t) = const ( 
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Corollary 2.14. Suppose the rational polytope P is given by (2.23). Then the 
Ehrhart series of P can be computed as 





1 
REE (orar G- a | 


Proof. By Theorem 2.13, 


Ehrp (x) = com (z Zz) (1 z2)... (1 a} qt 








i 1 at 
= cons _ 7c eg e a E _ 7c tb 
(1 — 2%) (1 — ze2)--- (1 — 2%) Sz 

1 1 

















Notes 


1. Convex polytopes are beautiful objects with a rich history and interest- 
ing theory, which we have only glimpsed here. For a good introduction to 
polytopes, we recommend [47, 90, 193]. Polytopes appear in a vast range 
of current research areas, including Grobner bases and commutative algebra 
[174], combinatorial optimization [159], integral geometry [110], and geometry 
of numbers [163]. 


2. The distinction between the vertex and hyperplane description of a convex 
polytope leads to an interesting algorithmic question; namely, how quickly can 
we retrieve the first piece of data from the second and vice versa [159, 193]? 


3. Ehrhart series are named after Eugène Ehrhart (1906-2000),* in antici- 
pation of the theorems we will prove in Chapter 3. The Ehrhart series of a 
polytope belonging to the special class of normal polytopes equals another 
rational generating function, the Hilbert—Poincaré series. These series appear 
in the study of graded algebras (see, for example, [96, 171]). Ehrhart series also 
appear in the context of toric varieties, a vast and fruitful subject [64, 84]. 


4. The Eulerian numbers A(d,k) are named after Leonhard Euler (1707— 
1783)° and arise naturally in the statistics of permutations: A (d, k) counts 
permutations of {1,2,...,d} with & —1 ascents. For more on A (d, k), see [62, 
Section 6.5]. 


t For more information about Ehrhart, see 
http://icps.u-strasbg.fr/~clauss/Ehrhart .html. 

5 For more information about Euler, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Euler.html. 
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5. The pyramids of Section 2.4 have an interpretation as order polytopes [172]. 
A curious fact about the lattice-point enumerators of these pyramids is that 
they have arbitrarily large real roots as the dimension grows [24]. 


6. The counting function Lo for the cross-polytope can, incidentally, also be 


written as 
min(d,t) 


> (e) 


In particular, Lo is symmetric in d and t. The cross-polytope counting func- 
tions bear a connection to Laguerre polynomials, the d-dimensional harmonic 
oscillator, and the Riemann hypothesis. This connection appeared in [51], 
where Daniel Bump, Kwok-Kwong Choi, Par Kurlberg, and Jeffrey Vaaler 
also found a curious fact about the roots of the polynomials Lo: they all have 
real part —} (an instance of a local Riemann hypothesis). This fact was proved 
independently by Peter Kirschenhofer, Attila Pethő, and Robert Tichy [109]; 
see also the Notes in Chapter 4. 


7. Theorem 2.8 marks the beginning of the general study of lattice-point 
enumeration in polytopes. Its amazingly simple statement was discovered by 
Georg Alexander Pick (1859-1942)®° in 1899 [143]. Pick’s theorem holds also 
for a nonconvex polygon, provided its boundary forms a simple curve. In Chap- 
ter 12, we prove a generalization of Pick’s theorem that includes nonconvex 
curves. 


8. The results of Section 2.7 appeared in [29]. We will see in Chapter 8 that 
the finite sums over roots of unity can be rephrased in terms of Dedekind- 
Rademacher sums, which—as we will see in Chapter 8—can be computed very 
quickly. The theorems of Section 2.7 will then imply that the discrete volume 
of any rational polygon can be computed efficiently. 


9. If we replace tb in (2.24) by a variable integer vector v, the counting 
function 


f(v) =# {xe ZS,: Ax=v} 


is called a vector partition function: it counts partitions of the vector v in terms 
of the columns of A. Vector partition functions are the multivariate analogues 
of our lattice-point enumerators Lp (t), have many interesting properties, and 
give rise to intriguing open questions [20, 40, 63, 173, 178]. 


10. While Leonhard Euler most likely did not think of lattice-point enumer- 
ation in the sense of Ehrhart, we attribute Theorem 2.13 to him, since he 
certainly worked with generating functions of this type, probably thinking of 


6 For more information about Pick, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Pick.html. 
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them as vector partition functions. The potential of Euler’s generating func- 
tion for Ehrhart polynomials was already realized by Ehrhart [79, 81]. Several 
modern approaches to computing Ehrhart polynomials are based on Theo- 
rem 2.13 (see, for example, [19, 46, 119]). 


Exercises 


2.1. & Fix positive integers a,b,c,d such that a/b < c/d, and let P be the 


interval ($, s] (so P is a 1-dimensional rational convex polytope). Compute 


Lp(t) = # (tP NA Z) and Lpo(t) and show directly that Lp (t) and Lpo (t) are 
quasipolynomials with period lcm(b, d) that satisfy 
Lpo(—t) = —Lp(t). 
(Hint: Exercise 1.4 (i).) 
2.2. & Fix positive rational numbers aj, b1,a2,b2 and let R be the rectan- 


gle with vertices (a1, 61), (a2, 61), (a@2,b2), and (a1, b2). Compute Lr(t) and 
Ehrp(z). 


2.3. Fix positive integers a and b, and let T be a triangle with vertices (0,0), 
,0), and (0, b). 


(a 

(a) Compute Lz(t) and Ehrz (2). 

(b) Use (a) to derive the following formula for the greatest common divisor of 
a and b: 


b—1 
ka 
d =2 — b—ab. 
gcd(a, b) H t+ a 


(Hint: Exercise 1.12.) 
2.4. Prove that for two polytopes P C R™ and Q C R”, 

#((P x Q)NZ™*") = #(PNZ™)-#(QNZ"). 
Hence, Lpxo(t) = Lp(t) Lo(t). 


2.5. Prove that if F is a face of P and G is a face of F, then G is also a face 
of P. (That is, the face relation is transitive.) 


2.6. & Suppose A is a d-simplex with vertices V = {v1, V2,..-,Va41}. Prove 
that for any nonempty subset W C V, conv W is a face of A, and conversely, 
that any face of A is of the form convW for some W C V. Conclude the 
following corollaries from this characterization of the faces of a simplex: 


(a) A face of any simplex is again a simplex. 
(b) The intersection of two faces of a simplex A is again a face of A. 
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2.7. œ Prove that a rational convex polytope can be described by a system 
of linear inequalities and equations with integral coefficients. 


2.8. & Prove the properties (2.3) of the Eulerian numbers for all integers 
1<k<d, namely: 

(a) A(d,k) = A(d,d+1—-—k&); 

(b) A(d,k) = (d-—k+1)A(d—1,k-—1)+kA(d—-1,k); 





k 
waan- Dn (t Ne- ya. 


2.9. de Prove (2.6); namely, for d > 0, TT = Vero Cy) a, 


2.10. & Prove (2.7): For t,k € Z and d € Zso, 


—t+k t+d—1-k 
—1)¢ = : 
Pace ) 
2.11. The Stirling numbers of the first kind, stirl(n,m), are defined through 
the finite generating function 


a(a—1)---(e-—n+1) = Y stioma” 


(See also [165, Sequence A008275].) Prove that 


d 
d+t 
)a-k 
Pi JAC! stirl(d +1, k + 1)t" ={ d ), 


the lattice-point enumerator for the standard d-simplex. 


2.12. Give a direct proof that the number of solutions (m1, M2, .. . , Ma+1) € 
hang to mı +m2+-+-+mMma41 = t equals eae? (Hint: think of t objects lined 
up and separated by d walls.) 


2.13. Compute Lp(t), where P is the regular tetrahedron with vertices 
(0, 0, 0) ) (£ 1, 0) ? GR 0, 1) 7 (0, I; 1). 


2.14. & Prove that the power series 


that defines the Bernoulli numbers has radius of convergence 27. 
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2.15. & Prove (2.11); namely, By(1 — x) = (—1)4B,(z). 
2.16. & Prove (2.12); namely, B4 = 0 for all odd d > 3. 


2.17. Show that for each positive integer n, 


na”! = 3 C) Bryn (2)- 


This gives us a change of basis for the polynomials of degree < n, allowing us 
to represent any polynomial as a sum of Bernoulli polynomials. 


2.18. As a complement to the previous exercise, show that we also have a 
change of basis in the other direction. Namely, we can represent a single 
Bernoulli polynomial in terms of the monomials as follows: 


B,(t) = 2 (;) Bp °. 


2.19. Show that for all positive integers m,n, and any x € R, 


(This is a Hecke-operator-type identity, originally found by Joseph Ludwig 
Raabe in 1851.) 


2.20. Show that Bna(x +1) — B,(x) = ng™t. 


2.21. An alternative way to define the Bernoulli polynomials is to give ele- 
mentary properties that uniquely characterize them. Show that the following 
three properties uniquely determine the Bernoulli polynomials, as defined in 
the text by (2.8): 


(a) Bola) =1. 
(b) aBa (e) = n Bn-1 (x), for all n > 1. 


(c) J By, (x) dx = 0, for all n > 1. 


2.22. Prove that for d = 1, 


and for d > 2, 


aY! A(d—1,k)z**! Ba 
f k=1 ? | 
S Balt) = a_i i 





Exercises 53 
2.23. & Prove Theorem 2.6: Ehrpipy:(g)(z) = 4 Ehrg(z). 


2.24. & Let R be an integral rectangle whose edges are parallel to the coordi- 
nate axes, and let 7 be a rectangular triangle two of whose edges are parallel 
to the coordinate axes. Show that Pick’s theorem holds for R and T. 


2.25. œ Suppose P is an integral polygon with area A and B lattice points on 
its boundary. Show that the area of tP is At?, and the number of boundary 
points on tP is Bt. (Hint: Exercise 1.12.) 


2.26. Let P be the self-intersecting polygon defined by the line segments 
[(0, 0), (4,2)], [(4, 2), (4,0)], [(4,0), (0, 2)], and [(0, 2), (0,0)]. Show that Pick’s 
theorem does not hold for P. 


2.27. Suppose that P and Q are integral polygons, and that Q lies entirely 
inside P. Then the area bounded by the boundaries of P and Q, denoted 
by P — Q, is a “doubly connected polygon.” Find and prove the analogue of 
Pick’s theorem for P — Q. Generalize your formula to a polygon with n “holes” 
(instead of one). 


2.28. Consider the rhombus 
R = {(x,y) : alx| + dly| < ad}, 


where a and b are fixed positive integers. Find a formula for Lr (t). 


2.29. We define the nt” Farey sequence as all the rational numbers ş in the 
interval [0,1] when a and b are coprime and b < n. For instance, the sixth 


Farey sequence is $, 4,2,4,4,2,3,2,3,3,4 





5 
6? 5? 4? 395? 295? 3° 495° 6 1° 


(a) For two consecutive fractions ¢ and § in a Farey sequence, prove that 


be-—ad=1. 
(b) For three consecutive fractions ¢, $, and F in a Farey sequence, show that 


c _ ate 


d b+f° 





2.30. Let [x] denote the smallest integer larger than or equal to x. Prove that 
for all positive integers a and b, 


a b 


a+(—1) 5° (<1) = b NEE] mod 4. 


m=0 n=0 


(Hint: This is a variation of Exercise 1.6. One way to obtain this identity is by 
counting lattice points in a certain triangle, keeping track only of the parity.) 


2.31. & Verify (2.19). 


2.32. & Compute the partial fraction coefficients (2.22). 
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2.33. Let a,b be positive integers. Show that 








1 k — b-1 
E 7 (z= £) ‘+a + terms with positive powers of (z—€*). 


2.34. & Let T be given by (2.16), and let c = gcd(e, f). Prove that 





Lr(t) = = (r-u)? +i (tr—a D EEr) 


1 a 1 l 1 l 1 /e M f eo 1 

4 e f)'`1i2\f e ' ef 
1 1 u+v-tr z eee ai ay) 
2e DF ef me ee ef A (=E)? 
giet) 1 f- ge 


p ı (1-8) (1-8) "hie ee) 


2.35. Let P be a rational polygon, and let d be the least common multiple 
of the denominators of the vertices of P. Prove directly (using Exercise 2.34) 
that the period of Lp divides d. 




















alos. 


2.36. & Finish the calculation in Example 2.12, that is, compute 


1 
(1 — 2122) (1 — 2222) (1 — 21) (1 — 29) 23t 22t ` 





const 


2.37. Compute the vector partition function of the quadrilateral given in Ex- 
ample 2.12, that is, compute the counting function 


Hove) = #{veaty: (1242)x=(2)] 


for v1, v2 € Z. (This function depends on the relationship between vı and v2.) 


2.38. Search on the Internet for the program polymake [86]. You can download 
it for free. Experiment. 


Open Problems 





2.39. Pick d+ 1 of the 2¢ vertices of the unit d-cube O, and let A be the 
simplex defined by their convex hull. 











(a) Which choice of vertices maximizes vol A? 
(b) What is the maximum volume of such a A? 
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2.40. Find classes of integer d-polytopes (Pa) ,., for which Lp, (t) is symmet- 
ric in d and t. (The standard simplices A and the cross-polytopes © form two 
such classes. ) 


2.41. We mentioned already in the Notes that all the roots of the polynomials 
Lo have real part —} (51, 109]. Find other classes of polytopes whose lattice- 
point enumerator exhibits such a special behavior. 
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Counting Lattice Points in Polytopes: 
The Ehrhart Theory 


Ubi materia, ibi geometria. 


Johannes Kepler (1571-1630) 


Given the profusion of examples that gave rise to the polynomial behavior of 
the integer-point counting function Lp (t) for special polytopes P, we now ask 
whether there is a general structure theorem. As the ideas unfold, the reader 
is invited to look back at Chapters 1 and 2 as appetizers and indeed as special 
cases of the theorems developed below. 


3.1 Triangulations and Pointed Cones 


Because most of the proofs that follow work like a charm for a simplex, we first 
dissect a polytope into simplices. This dissection is captured by the following 
definition. 

A triangulation of a convex d-polytope P is a finite collection T of d- 
simplices with the following properties: 


e P=|]JA. 


AET 
e For any Aj, Ag E T, A, N Ag is a face common to both A; and Ag. 


We say that P can be triangulated using no new vertices if there exists 
a triangulation T such that the vertices of any A € T are vertices of P. 


Theorem 3.1 (Existence of triangulations). Every convex polytope can 
be triangulated using no new vertices. 


This theorem seems intuitively obvious but is not entirely trivial to prove. 
We carefully work out a proof in Appendix B. 
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Fig. 3.1. Two (very different) triangulations of the 3-cube. 


A pointed cone K C Rê is a set of the form 
K = {v + Wi + Agwe +--+ +AmWm: M1, A2;---;Am = OF" 


where v, W1,W2,.-.,Wm E R? are such that there exists a hyperplane H for 
which HONK = {v}; that is, K \ {v} lies strictly on one side of H. The 
vector v is called the apex of K, and the w;’s are the generators of K. 
The cone is rational if v, w1, W2,...,Wm E Q%, in which case we may choose 
W1,W2,...,Wm E Z? by clearing denominators. The dimension of K is the 
dimension of the affine space spanned by K; if K is of dimension d, we call it a 
d-cone. The d-cone K is simplicial if K has precisely d linearly independent 
generators. 

Just as polytopes have a description as an intersection of half-spaces, so 
do pointed cones: A rational pointed d-cone is the d-dimensional intersection 
of finitely many half-spaces of the form 


{xe R?: atı + agar +--+ + agra < b} 


with integral parameters aj,d2,...,@a,b € Z such that the corresponding 
hyperplanes of the form 


{xe R?: atı + ar +--+ + agra = b} 


meet in exactly one point. 

Cones are important for many reasons. The most practical for us is a 
process called coning over a polytope. Given a convex polytope P C R? with 
vertices V1, V2,..., Vn, we lift these vertices into R¢+! by adding a 1 as their 
last coordinate. So, let 


wi = (vi, 1), w2 = (va, 1), BASS Sie Wn = (Vn, 1). 
Now we define the cone over P as 
cone(P) = {A1wi + Agwe +++: +AnWn: A1,A2,---,An > OF} C Re. 


This pointed cone has the origin as apex, and we can recover our original 
polytope P (strictly speaking, the translated set {(x,1): x € P}) by cutting 
cone(P) with the hyperplane xg4; = 1, as shown in Figure 3.2. 
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T3 








Fig. 3.2. Coning over a polytope. 


By analogy with the language of polytopes, we say that the hyperplane 
H = {x € R? : a- x = b} is a supporting hyperplane of the pointed d-cone 
K if K lies entirely on one side of H, that is, 


Kc {xe Rt: a-x<b} or Kc{xeRt:a-x>bd}. 


A face of K is a set of the form KM H, where H is a supporting hyperplane 
of K. The (d — 1)-dimensional faces are called facets and the 1-dimensional 
faces edges of K. The apex of K is its unique 0-dimensional face. 

Just as polytopes can be triangulated into simplices, pointed cones can be 
triangulated into simplicial cones. So, a collection T of simplicial d-cones is a 
triangulation of the d-cone K if it satisfies: 


e K=UsS. 


SET 
e For any S1, S2 € T, S1 N S2 is a face common to both Sı and S2. 
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We say that K can be triangulated using no new generators if there 
exists a triangulation T such that the generators of any S € T are generators 
of P. 


Theorem 3.2. Any pointed cone can be triangulated into simplicial cones us- 
ing no new generators. 


Proof. This theorem is really a corollary to Theorem 3.1. Given a pointed d- 
cone K, there exists a hyperplane H that intersects K only at the apex. Now 
translate H “into” the cone, so that HNK consists of more than just one point. 
This intersection is a (d — 1)-polytope P, whose vertices are determined by 
the generators of K. Now triangulate P using no new vertices. The cone over 
each simplex of the triangulation is a simplicial cone. These simplicial cones, 
by construction, triangulate K. 














3.2 Integer-Point Transforms for Rational Cones 


We want to encode the information contained by the lattice points in a set 
S C R4. It turns out that the following multivariate generating function allows 
us to do this in an efficient way if S is a rational cone or polytope: 


og(Z) = os (21, 22,---, Zd) := 5 zm. 


meSNZ4 


The generating function og simply lists all integer points in S in a special 
form: not as a list of vectors, but as a formal sum of monomials. We call og 
the integer-point transform of S; the function og also goes by the name 
moment generating function or simply generating function of S. The integer- 
point transform og opens the door to both algebraic and analytic techniques. 


Example 3.3. As a warm-up example, consider the 1-dimensional cone K = 
[0, co). Its integer-point transform is our old friend 


0x (Zz) = 5 m= Da. 


m€[0,00)NZ m>0 

















Example 3.4. Now we consider the two-dimensional cone 
K := 0, 1) + à2(—2, 3) : A1,A2 = 0} G R? 


depicted in Figure 3.3. To obtain the integer-point transform ox, we tile K 
by copies of the fundamental parallelogram 


II := {Ai (1, 1) + à2(—2, 3) : 0< Aq, A2 < 1} C R2. 


More precisely, we translate II by nonnegative integer linear combinations 
of the generators (1,1) and (—2,3), and these translates will exactly cover 
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K. How can we list the integer points in K as monomials? Let’s first list all 
vertices of the translates of II. These are nonnegative integer combinations of 
the generators (1,1) and (—2,3), so we can list them using geometric series: 


: 1 
m _ J(,1)+k(—2,3) _ 
» a dD (1 — 2122) (1 — 27723) © 


m=j(1,1)+k(—2,3) j>0 k>0 
j,k>0 





We now use the integer points (m,n) € II to generate a subset of Z? by adding 
to (m,n) nonnegative linear integer combinations of the generators (1,1) and 
(—2,3). Namely, we let 


L (m,n) = {(m,n) ag (hy 1) t k(—2,3) : j, ke Z>o} $ 


It is immediate that KZ? is the disjoint union of the subsets L(m,n) as (m, n) 
ranges over II N Z? = {(0,0), (0,1), (0, 2), (—1, 2), (—1,3)}. Hence 


ox (2) = (1+ z2 +23 + 21123 + 27°23) > z” 
m=J(1,1)-+k(—2,3) 
Gk 20 


-1 -1 
L+ z2 +22 +z 23 +2,°23 

















(1 — 2122) (1 = zi 23) 





Tı 


Fig. 3.3. The cone K and its fundamental parallelogram. 
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Similar geometric series suffice to describe integer-point transforms for 
simplicial d-cones. The following result utilizes the geometric series in several 
directions simultaneously. 


Theorem 3.5. Suppose 
K := {Aywy + Agwe +--+ + Aawa: Ar, A2,---;Aa > OF 


is a simplicial d-cone, where w1,W2,...,waq € Z°. Then for v € RÊ, the 
integer-point transform ov+x of the shifted cone v+K is the rational function 


ov+i(Z) 
1 —2z™1) (1 — 22)... (1 — zwa)’ 





Ov+K (2) a ( 
where II is the half-open parallelepiped 
I= {A1W1 + A2Ww2 +--+ + Aawa : 0 < Ay, A2,..., Ad < Ti 


The half-open parallelepiped II is called the fundamental parallelepiped 
of K. 


Proof. In ovx (z) = X meteni z™, we list each integer point m in v+K 
as the monomial z™. Such a lattice point can, by definition, be written as 


m = v + ÀW + Aqwe +--+: + Agwa 


for some numbers Aj, A2,..., Aq > 0. Because the w;’s form a basis of RI, this 
representation is unique. Let’s write each of the A,’s in terms of their integer 
and fractional part: Ap = [A] + {Ax}. So 


m = v+({Ai} wit{A2} wot s -+{Aa} Wa)+{A1] wit|r2| Wot: ¢ +1 Aa] Wd, 
and we should note that, since 0 < {Ax} < 1, the vector 
p := v + {Ar} wi + {Ao} we +- + {Aa} wa 


is in v +I. In fact, p € Z4, since m and |; | wz are all integer vectors. Again 
the representation of p in terms of the w,’s is unique. In summary, we have 
proved that any m € v + KN Zf can be uniquely written as 





m= p+ kiwi + kawa +--+ + kawa (3.1) 


for some p € (v + II) N Zf and some integers k1, k2,...,kq > 0. On the other 
hand, let us write the rational function on the right-hand side of the theorem 
as a product of series: 


(1 — aal zwa) a 5 zP 5 7Zkiwi ane 5y gkWa 


pE(v+HI)NZ4 kı>0 ka>0 





If we multiply everything out, a typical exponent will look exactly like (3.1). 
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Our proof contains a crucial geometric idea. Namely, we tile the cone v+% 
with translates of v + II by nonnegative integral combinations of the w;’s. It 
is this tiling that gives rise to the nice integer-point transform in Theorem 3.5. 
Computationally, we therefore favor cones over polytopes due to our ability 
to tile a simplicial cone with copies of the fundamental domain, as above. 
Another reason for favoring cones over polytopes appears in Brion’s theorem 
in Chapter 9. 

Theorem 3.5 shows that the real complexity of computing the integer- 
point transform oy+ is embedded in the location of the lattice points in the 
parallelepiped v + II. 

By mildly strengthening the hypothesis of Theorem 3.5, we obtain a 
slightly easier generating function, a result we shall need in Section 3.4 and 
Chapter 4. 


Corollary 3.6. Suppose 
K := {Aywi + Awe +--+ + Aawa : Ar, A2,---,Aa > OF 


is a simplicial d-cone, where w1,W2,...,Wa € Zt, and v € R, such that the 
boundary of v+ K contains no integer point. Then 


O. K(z) = Ov+t(Z) 
v+ (1 —2™1) (1 — zw2)--. (1 — zwa)’ 





where II is the open parallelepiped 
II := {Awi + A2w2 +--+ + Aawa : 0 < Ay, A2,..-,Ad < 1}. 


Proof. The proof of Theorem 3.5 goes through almost verbatim, except that 
v+II now has no boundary lattice points, so that there is no harm in choosing 
II to be open. 














Since a general pointed cone can always be triangulated into simplicial 
cones, the integer-point transforms add up in an inclusion—exclusion manner 
(note that the intersection of simplicial cones in a triangulation is again a 
simplicial cone, by Exercise 3.2). Hence we have the following corollary. 


Corollary 3.7. Given any pointed cone 
K = {v + Awi + A2W2 + +AmWm : AIAD eis Am >0} 


with v € RÌ, w1, W2,...,Wm € Z$, the integer-point transform og(z) evalu- 
ates to a rational function in the coordinates of z. 














Philosophizing some more, one can show that the original infinite sum 
ox(z) converges only for z in a subset of C?, whereas the rational function 
that represents ox gives us its meromorphic continuation. Later, in Chapters 
4 and 9, we make use of this continuation. 
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3.3 Expanding and Counting Using Ehrhart’s Original 
Approach 


Here is the fundamental theorem concerning the lattice-point count in an 
integral convex polytope. 


Theorem 3.8 (Ehrhart’s theorem). If P is an integral convex d-polytope, 
then Lp(t) is a polynomial in t of degree d. 


This result is due to Eugene Ehrhart, in whose honor we call Lp the 
Ehrhart polynomial of P. Naturally, there is an extension of Ehrhart’s 
theorem to rational polytopes, which we will discuss in Section 3.7. 

Our proof of Ehrhart’s theorem uses generating functions of the form 
iso f(t) 2’, similar in spirit to the ones discussed in the beginning of Chap- 
ter 1. If f is a polynomial, this power series takes on a special form, which we 
invite the reader to prove (Exercise 3.8): 


Lemma 3.9. If 


Dror- 7S. 


t>0 


then f is a polynomial of degree d if and only if g is a polynomial of degree at 
most d and g(1) £0. 














The reason we introduced generating functions of the form og(z) = 
dimesnze Z™ in Section 3.2 is that they are extremely handy for lattice-point 
problems. The connection to lattice points is evident, since we are summing 
over them. If we’re interested in the lattice-point count, we simply evaluate 
og at z = (1,1,...,1): 


oi Aid $X 1®= $ 1=#(S072°). 


meSNZ4 meSNZ4 


(Here we denote by 1 a vector all of whose components are 1.) Naturally, we 
should make this evaluation only if S is bounded; Theorem 3.5 already tells 
us that it’s no fun evaluating øx (1) if K is a cone. 

But the magic of the generating function og does not stop there. To liter- 
ally take it to the next level, we cone over a convex polytope P. If P c R? has 
the vertices v1, V2,...,Vn € ZÎ, recall that we lift these vertices into R¢+, 
by adding a 1 as their last coordinate. So let 


w1 = (v1, 1), We = (vo, 1), ..., Wn = (Vn, 1). 
Then 


cone(P) = {Ayw1 + A2W2 +--: + AnWn : At, Àz,- An > 0} C RIH, 
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Recall that we can recover our original polytope P by cutting cone(P) with 
the hyperplane rg; = 1. We can recover more than just the original polytope 
in cone(P): By cutting the cone with the hyperplane za+1 = 2, we obtain a 
copy of P dilated by a factor of 2. (The reader should meditate on why this cut 
is a 2-dilate of P.) More generally, we can cut the cone with the hyperplane 
La+1 = t and obtain tP, as suggested by Figure 3.4. 





Fig. 3.4. Recovering dilates of P in cone(P). 


Now let’s form the integer-point transform 0 .one(p) of cone(P). By what we 
just said, we should look at different powers of zg+1: there is one term (namely, 
1), with 29 41, corresponding to the origin; the terms with zl 41 correspond to 
lattice points in P (listed as monomials in 21, 22,..., 2a), the terms with 23, , 
correspond to points in 2P, etc. In short, 
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OF cone(P) (21, Z233 Zd+1) 
= 1 + 0p (z,..., 24) Za41 + Oop (zind) ea ete ee 
= 1+) op (Pigs iosea) Zii 
t>1 
Specializing further for enumeration purposes, we recall that op (1,1,...,1) = 


# (P N Z), and so 


Ocone(P) (1, 1,...,1, Zd41) =1+ ys OtP (1, haah 1) zayi 
t>1 


=14+5°#(PNZ") zi. 


t>1 


But by definition, the enumerators on the right-hand side are just evaluations 
of Ehrhart’s counting function, that is, Ccone(p) (1, 1,.--,1,2a41) is nothing 
but the Ehrhart series of P: 














Lemma 3.10. Ocone(p) (1; 1,.-.,1,2) = 1 + XC Lp(t) 2 = Ehrp(z). 
t>1 


With all this machinery at hand, we can prove Ehrhart’s theorem. 


Proof of Theorem 3.8. It suffices to prove the theorem for simplices, because 
we can triangulate any integral polytope into integral simplices, using no new 
vertices. Note that these simplices will intersect in lower-dimensional integral 
simplices. 

By Lemma 3.9, it suffices to prove that for an integral d-simplex A, 


Ehra(z) = 1 +5 La(t) z= T 


for some polynomial g of degree at most d with g(1) 4 0. In Lemma 3.10 
we showed that the Ehrhart series of A equals @gone(a) (1, 1,..-, 1, Z), so let’s 
study the integer-point transform attached to cone(A). 

The simplex A has d+1 vertices v1, v2,...,Va41, and so cone(A) C R+! 
is simplicial, with apex the origin and generators 


wi = (vi, 1), w2 =(va,1), ..., Wapi = (Vapi, 1) € ZH. 
Now we use Theorem 3.5: 


OTL (21, 22,- -- , Žd+1) 
1-21) (1 — zwW2)... (1 — zWa+1) i 





Tcone(A) (z1, 22,2085 Zd+1) = ( 


where IT = {Aw + À2W2 +: + ÀAd+1Wad+1 :0< 1, A2, Bie rd+1 Š 1}. This 
parallelepiped is bounded, so the attached generating function oy is a Laurent 
polynomial in 21, 22,..., Za41- 
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We claim that the zg,1-degree of oy is at most d. In fact, since the rg41- 
coordinate of each wx is 1, the xg41-coordinate of a point in IT is Ay +Ag+---+ 
Aa+1 for some 0 < Ay, A2,.--,Ad¢1 < 1. But then Ay + Ag +--+ +Aa41 < d1, 
so if this sum is an integer it is at most d, which implies that the zgi1-degree 











of on (21, Z2,---,Zd41) is at most d. Consequently, on (1,1,...,1, za41) is a 
polynomial in zg, of degree at most d. The evaluation oy (1,1,1,...,1) of this 
polynomial at zq41 = 1 is not zero, because oy (1,1,1,...,1) = # (I N Z+!) 
and the origin is a lattice point in II. 

Finally, if we specialize ZW% to z1 = z2 =--- = zq = 1, we obtain zj,,, so 
that E a ) 

Ocone(A) (1, Ls stsegols Zd+1) T on ( —— say 
(1 — 2a41) 














The left-hand side is Ehra (za41) = 1+)0,31 La (t) 244, by Lemma 3.10. 


3.4 The Ehrhart Series of an Integral Polytope 


We can actually take our proof of Ehrhart’s theorem one step further by study- 
ing the polynomial oy (1,1,...,1, 241). As mentioned above, the coefficient 
of z" +ı simply counts the integer points in the parallelepiped II cut with the 
hyperplane xg41 = k. Let us record this. 


Corollary 3.11. Suppose A is an integral d-simplex with vertices v1, V2,... 
Va+1, and let wj = (vj,1). Then 


— haz? + haz% +: + hi z+ ho 


Ehra (z) = 1+ 5 La (t) z* (0-A ’ 


t>1 





where hy equals the number of integer points in 


{Awi + Agwe +-+ + Agt1 Way: O< Ai, A2,---, Adya < 1} 








with last variable equal to k. 





This result can actually be used to compute Ehra, and therefore the 
Ehrhart polynomial, of an integral simplex A in low dimensions very quickly 
(a fact that the reader may discover in some of the exercises). We remark, 
however, that things are not as simple for arbitrary integral polytopes. Not 
only is triangulation a nontrivial task in general, but one would also have to 
deal with overcounting where simplices of a triangulation meet. 

Corollary 3.11 implies that the numerator of the Ehrhart series of an inte- 
gral simplex has nonnegative coefficients, since those coefficients count some- 
thing. Although the latter cannot be said of the coefficients of the Ehrhart 
series of a general integral polytope, the nonnegativity property magically 
survives. 
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Theorem 3.12 (Stanley’s nonnegativity theorem). Suppose P is an in- 
tegral convex d-polytope with Ehrhart series 


= haz? + ha-12% 1 +--+ ho 





Ehrp(z) = (jae 
Then ho, hi,. . -ha = 0. 
Proof. Triangulate cone(P) C R¢*! into the simplicial cones K1,K2,...,Km. 


Now Exercise 3.14 ensures that there exists a vector v € R+! such that 
cone(P) N Z? = (v + cone(P)) N Z? 


(that is, we neither lose nor gain any lattice points when shifting cone(P) by 
v) and neither the facets of v + cone(P) nor the triangulation hyperplanes 
contain any lattice points. This implies that every lattice point in v + cone(P) 
belongs to exactly one simplicial cone v + K;: 


cone(P) N Z? = (v + cone(P)) N Z? = Ù ((v+K;)NZ*), (3.2) 


j=l 
and this union is a disjoint union. If we translate the last identity into 
generating-function language, it becomes 


m 
Ocone(P) (z1, Z233 Zd+1) = X Ov+K; (z1, 22,004 Zd+1) : 
j=l 


But now we recall that the Ehrhart series of P is just a special evaluation of 
Fcone(p) (Lemma 3.10): 


Ehrp(z) = deone(P) (1, 1,- 1,2) = XC oye (1,1,1,2). (3.3) 

j=1 
It suffices to show that the rational generating functions ov+x; (1, 1,...,1, z) 
for the simplicial cones v + K; have nonnegative numerator. But this fact 
follows from evaluating the rational function in Corollary 3.6 at (1,1,...,1,z). 














This proof shows a little more: Since the origin is in precisely one simplicial 
cone on the right-hand side of (3.2), we get on the right-hand side of (3.3) 
precisely one term that contributes 1/(1 — z)¢*+ to Ehrp; all other terms 
contribute to higher powers of the numerator polynomial of Ehrp. That is, 
the constant term hg equals 1. The reader might feel that we are chasing our 
tail at this point, since we assumed from the very beginning that the constant 
term of the infinite series Ehrp is 1, and hence ho has to be 1, as a quick look 
at the expansion of the rational function representing Ehrp shows. The above 
argument shows merely that this convention is geometrically sound. Let us 
record this: 
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Lemma 3.13. Suppose P is an integral convex d-polytope with Ehrhart series 


haz? + hg_-1 2471 +++ + ho 


Ehrp(z) = (2) 








Then ho = 1. 











For a general integral polytope P, the reader has probably already discov- 
ered how to extract the Ehrhart polynomial of P from its Ehrhart series: 


Lemma 3.14. Suppose P is an integral convex d-polytope with Ehrhart series 





haz? + ha-12 1+- +hiz+1 
Ehrp(2) =1+ > 7 Lp(t) 2! = — ET i l 


t>1 


Then 


Lp(t) = ey h(t tetra g ) Htela) 


Proof. Expand into a binomial series: 


haz? + ha-12 t+- +hiz+1 
Ehrp(z) = (1 — z) 


t+d 
a d d—1 t 
= (haz + ha-ı z tote) Je 


t+d t+d 
wy (Me hE (Mee 


t>0 t>0 


+h) Gy Ga. 


t>0 t>0 


= had (i )e + has 5 (ite 


t>d t>d—1 


+h > GPTD EESD ea. 


t>1 t>0 





In all infinite sums on the right-hand side, we can start the index t with 0 
without changing the sums, by the definition (2.1) of the binomial coefficient. 
Hence 


Ehrp(z 


) 
gU een) (39) 
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The representation of the polynomial Lp(t) in terms of the coefficients 
of Ehrp can be interpreted as the Ehrhart polynomial expressed in the basis 
(5), (1), x e (2) (see Exercise 3.9). This representation is very useful, as 
the following results show. 


Corollary 3.15. If P is an integral convex d-polytope, then the constant term 
of the Ehrhart polynomial Lp is 1. 


Proof. Use the expansion of Lemma 3.14. The constant term is 


mos (Jan e = 


This proof is exciting, because it marks the first instance where we extend 
the domain of an Ehrhart polynomial beyond the positive integers, for which 
the lattice-point enumerator was initially defined. More precisely, Ehrhart’s 
theorem (Theorem 3.8) implies that the counting function 














Lp(t) = #(tPAZ?), 


originally defined for positive integers t, can be extended to all real or even 
complex arguments t (as a polynomial). A natural question arises: are there 
nice interpretations of Lp(t) for arguments t that are not positive integers? 
Corollary 3.15 gives such an interpretation for t = 0. In Chapter 4, we will 
give interpretations of Lp(t) for negative integers t. 


Corollary 3.16. Suppose P is an integral convex d-polytope with Ehrhart se- 
ries 
haz? thas1ee +--+ +h ztl 
Ehrp(z) = (aye i 


Then hı = Lp(1) -d-1=#(PNZ*) —d-1. 





Proof. Use the expansion of Lemma 3.14 with t = 1: 


d+1 d 2 1 
tx(t) = ( i Jtag) thag) Hrag) <a ba tha. 


The proof of Corollary 3.16 suggests that there are also formulas for 
h2,h3,... in terms of the evaluations Lp(1), Lp(2),..., and we invite the 
reader to find them (Exercise 3.10). 

A final corollary to Theorem 3.12 and Lemma 3.14 states how large the 
denominators of the Ehrhart coefficients can be: 














Corollary 3.17. Suppose P is an integral polytope with Ehrhart polynomial 
Lp(t) = catt +ca-1 te 14+---+e,t+1. Then all coefficients satisfy d! cp € Z. 
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Proof. By Theorem 3.12 and Lemma 3.14, 


t+d t+d-1 t+1 t 
teo) = ( d )+a( d ) tot heal d ) trafa); 


where the h,’s are integers. Hence multiplying out this expression yields a 
polynomial in t whose coefficients can be written as rational numbers with 
denominator d!. 














We finish this section with a general result that gives relations between 
negative integer roots of a polynomial and its generating function. This the- 
orem will become handy in Chapter 4, in which we find an interpretation for 
the evaluation of an Ehrhart polynomial at negative integers. 


Theorem 3.18. Suppose p is a degree-d polynomial with the rational gener- 
ating function 

















=n -za 
Then ha = ha-1 = +++ = hk41 = 0 and hp # 0 if and only if p(—1) = p(—2) = 
sep AA =0 and p(—(d—k+ 1) £0. 
Proof. Suppose hg = ha-1 = ++: = hey, = 0 and hy Æ 0. Then the proof of 








Lemma 3.14 gives 


t+d TE LT ae 
O A totaal? ; em (OS ). 


All the binomial coefficients are zero for t = —1, —2,...,—d + k, so those are 
roots of p. On the other hand, all binomial coefficients but the last one are 
zero for t = —d + k — 1, and since hk 40, —d + k — 1 is not a root of p. 

Conversely, suppose p(—1) = p(—2) = --- = p(—(d—k)) = 0 and 
p(—(d—k+1)) #0. The first root —1 of p gives 


o= =t r mS G7) al the) =e( 


so we must have hg = 0. The next root —2 forces hg_1 = 0, and so on, up to 
the root —d + k, which forces h41 = 0. It remains to show that hy, #4 0. But 
if hy were zero then, by a similar line of reasoning as in the first part of the 
proof, p(—d + k — 1) = 0, a contradiction. 

















3.5 From the Discrete to the Continuous Volume 
of a Polytope 


Given a geometric object S C Rf, its volume, defined by the integral vol S := 
Ís dz, is one of the fundamental data of S. By the definition of the integral, say 
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in the Riemannian sense, we can think of computing vol S by approximating 
S with d-dimensional boxes that get smaller and smaller. To be precise, if 
we take the boxes with side length 1/t then they each have volume 1/t¢. We 
might further think of the boxes as filling out the space between grid points in 


the lattice EZ)’. This means that volume computation can be approximated 


by counting boxes, or equivalently, lattice points in (4z)*: 


lS = li g SNA LN 
ea T : 


It is a short step to counting integer points in dilates of S, because 
1.\4 
# (se (42) = #(tSNZ?*). 


Lemma 3.19. Suppose S C R? is d-dimensional. Then 


Let us summarize: 














1 
vol $= lim 7: # (#59 Z*). 


We emphasize here that S is d-dimensional, because otherwise (since S could 
be lower-dimensional although living in d-space), by our current definition 
vol S = 0. (We will extend our volume definition in Chapter 5 to give nonzero 
relative volume to objects that are not full-dimensional.) 

Part of the magic of Ehrhart’s theorem lies in the fact that for an integral 
d-polytope P, we do not have to take a limit to compute volP; we need to 
compute “only” the d+ 1 coefficients of a polynomial. 


Corollary 3.20. Suppose P C R? is a integral convex d-polytope with Ehrhart 
polynomial ca tt + cy_-1 te +--+ qt+1. Then ca = vol P. 


Proof. By Lemma 3.19, 








ti me ee t+1 
volP = lim Cd + Cq 1 + + Cy + = 


— CO td 








Cd - 





On the one hand, this should not come as a surprise, because counting 
integer points in some object should grow roughly like the volume of the 
object as we make it bigger and bigger. On the other hand, the fact that we 
can compute the volume as one term of a polynomial should be very surprising: 
the polynomial is a counting function and as such is something discrete, yet by 
computing it (and its leading term), we derive some continuous data. Even 
more, we can—at least theoretically—compute this continuous datum (the 
volume) of the object by calculating a few values of the polynomial and then 
interpolating; this can be described as a completely discrete operation! 

We finish this section by showing how to retrieve the continuous volume 
of an integer polytope from its Ehrhart series. 
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Corollary 3.21. Suppose P C R? is an integral convex d-polytope, and 


haz? + ha-12 1+- +hiz+1 
Ehrp(z) = (ayer ; 





1 
Then vol P = | (ha +ha-1 +++ + hi +1). 


Proof. Use the expansion of Lemma 3.14. The leading coefficient is 














1 
q (hat haa + thi +1). 


3.6 Interpolation 


We now use the polynomial behavior of the discrete volume Lp of an integral 
polytope P to compute the continuous volume vol? and the discrete volume 
Lp from finite data. 

Two points uniquely determine a line. There exists a unique quadratic 
passing through any three given points. In general, a degree-d polynomial 
p is determined by d+ 1 points (x, p(x)) € R?. Namely, evaluating p(x) = 


Caxt + ca-i! +--+ + co at distinct inputs £1, £2,..., a41 gives 
p (x1) Ca 
p (x2) Cd-1 
. =V 3 (3.4) 
p (£a+1) Co 
where 
xrl a . xı 1 
d „d-1 
Doni t 1 
v- 2 z 
sia ahida 
so that 
Ca p (x1) 
Cd-1 P (x2) 
=v! ; : (3.5) 
Co p (&a41) 


(Exercise 3.16 makes sure that V is invertible.) The identity (3.5) gives the 
famous Lagrange interpolation formula. 

This gives us an efficient way to compute Dp, at least when dim? is not 
too large. The continuous volume of P will follow instantly, since it is the 
leading coefficient cq of Lp. In the case of an Ehrhart polynomial Lp, we 
know that Lp(0) = 1, so that (3.4) simplifies to 
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Lp (zı)— 1 me gl EE Ca 
Lp (x2) —1 ge af} ete Bo Cd-1 
Lp (xq) —1 at oo sis Ld C1 


Example 3.22 (Reeve’s tetrahedron). Let 7p be the tetrahedron with 
vertices (0,0,0), (1,0,0), (0,1,0), and (1,1,h), where h is a positive integer 
(see Figure 3.5). 
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Sa ese cece none as Dawns e = 


Fig. 3.5. Reeve’s tetrahedron 7p (and 27;). 
To interpolate the Ehrhart polynomial Lz, (t) from its values at various 
points, we use Figure 3.5 to deduce the following: 


4 = Lr, (1) = vo (Th) teg toatl, 
h+9= Lr, (2) = vol (Th) 2 + c2: +e -241. 


Using the volume formula for a pyramid, we know that 
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1 ; h 
vol (Tn) = z (base area) (height) = = 


Thus h + 1 = h + 2c2 — 1, which gives us cg = 1 and cı = 2 — k, Therefore 














h h 
Int) =e eA (2-Z) e+. 


3.7 Rational Polytopes and Ehrhart Quasipolynomials 


We do not have to change much to study lattice-point enumeration for rational 
polytopes, and most of this section will consist of exercises for the reader. The 
structural result paralleling Theorem 3.8 is as follows. 


Theorem 3.23 (Ehrhart’s theorem for rational polytopes). If P is a 
rational convex d-polytope, then Lp(t) is a quasipolynomial in t of degree 
d. Its period divides the least common multiple of the denominators of the 
coordinates of the vertices of P. 


We will call the least common multiple of the denominators of the co- 
ordinates of the vertices of P the denominator of P. Theorem 3.23, also 
due to Ehrhart, extends Theorem 3.8, because the denominator of an integral 
polytope P is one. Exercises 3.21 and 3.22 show that the word “divides” in 
Theorem 3.23 is far from being replaceable by “equals.” 

We start the path toward a proof of Theorem 3.23 by stating the analogue 
of Lemma 3.9 for quasipolynomials (see Exercise 3.19): 


Lemma 3.24. If 





t g(z) 
dL OE Fe 


t>0 


then f is a quasipolynomial of degree d with period dividing p if and only if g 
and h are polynomials such that deg(g) < deg(h), all roots of h are p'” roots 
of unity of multiplicity at most d+ 1, and there is a root of multiplicity equal 
tod+1 (all of this assuming that g/h has been reduced to lowest terms). 














Our goal is now evident: we will prove that if P is a rational convex d- 
polytope with denominator p, then 


Ehrp(z) =1 + 5° Lp(t) z= e 


for some polynomial g of degree less than p(d + 1). As in Section 3.3, we will 
have to prove this only for the case of a rational simplex. So suppose the d- 
simplex A has vertices v1, Vv2,...,Va+1 € Qf, and the denominator of A is p. 
Again we will cone over A: let 
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w1 = (v1, 1), we = (vo, 1),.-., Wasi = (Va41, 1); 
then 
cone(A) = {Ar wy + Agw2 +-+- + Addi Wasi: At, A2,---,;Aadt1 > 0} C Re 


To be able to use Theorem 3.5, we first have to ensure that we have a descrip- 
tion of cone(A) with integral generators. But since the denominator of A is p, 
we can replace each generator w by pw, € Z’+!, and we're ready to apply 
Theorem 3.5. From this point, the proof of Theorem 3.23 proceeds exactly like 
that of Theorem 3.8, and we invite the reader to finish it up (Exercise 3.20). 

Although the proofs of Theorem 3.23 and Theorem 3.8 are almost identical, 
the arithmetic structure of Ehrhart quasipolynomials is much more subtle and 
less well known than that of Ehrhart polynomials. 


3.8 Reflections on the Coin-Exchange Problem 
and the Gallery of Chapter 2 


At this point, we encourage the reader to look back at the first two chapters in 
light of the basic Ehrhart-theory results. Popoviciu’s theorem (Theorem 1.5) 
and its higher-dimensional analogue give a special set of Ehrhart quasipoly- 
nomials. On the other hand, in Chapter 2 we encountered many integral 
polytopes. Ehrhart’s theorem (Theorem 3.8) explains why their lattice-point 
enumeration functions were all polynomials. 


Notes 


1. Triangulations of polytopes and manifolds are an active source of research 
with many interesting open problems; see, e.g., [69]. 


2. Eugene Ehrhart laid the foundation for the central theme of this book in 
the 1960s, starting with the proof of Theorem 3.8 in 1962 [79]. The proof we 
give here follows Ehrhart’s original lines of thought. An interesting fact is that 
he did his most beautiful work as a teacher at a lycée in Strasbourg (France), 
receiving his doctorate at age 60 on the urging of some colleagues. 


3. Given any d linearly independent vectors w1,W2,...,Wa € Rf, the lat- 
tice generated by them is the set of all integer linear combinations of 
W1, W2,..., Wq. Alternatively, one can define a lattice as a discrete subgroup 
of R¢, and these two notions can be shown to be equivalent. One might won- 
der whether replacing the lattice Z? by an arbitrary lattice £ throughout the 
statements of the theorems—requiring now that the vertices of a polytope be 
in £—gives us any different results. The fact that the theorems of this chapter 
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remain the same follows from the observation that any lattice can be mapped 
to Zi by an invertible linear transformation. 


4. Richard Stanley developed much of the theory of Ehrhart (quasi-)polyno- 
mials, initially from a commutative-algebra point of view. Theorem 3.12 is 
due to him [170]. The proof we give here appeared in [30]. An extension of 
Theorem 3.12 was found by Takayuki Hibi; he proved that if hg > 0 then 
hy > hı for all 1 < k < d — 1 (using the notation of Theorem 3.12) [98]. 


5. The tetrahedron 7, of Example 3.22 was used by John Reeve to show that 
Pick’s theorem does not hold in R (see Exercise 3.18) [154]. Incidentally, the 
formula for Lz, also proves that the coefficients of an Ehrhart polynomial (of 
a closed polytope) are not always positive. 


6. There are several interesting questions (some of which are still open) regard- 
ing the periods of Ehrhart quasipolynomials. Some particularly nice examples 
about what can happen with periods were given by Tyrrell McAllister and 
Kevin Woods [126]. 


7. Most of the results remain true if we replace “convex polytope” by “poly- 
topal complex,” that is, a finite union of polytopes. One important exception 
is Corollary 3.15: the constant term of an “Ehrhart polynomial” of an integral 
polytopal complex C is the Euler characteristric of C. 


8. The reader might wonder why we do not discuss polytopes with irrational 
vertices. The answer is simple: nobody has yet found a theory that would par- 
allel the results in this chapter, even in dimension two. One notable exception 
is [11], in which irrational extensions of Brion’s theorem are given; we will 
study the rational case of Brion’s theorem in Chapter 9. On the other hand, 
Ehrhart theory has been extended to functions other than strict lattice-point 
counting; one instance is described in Chapter 11. 


Exercises 
3.1. To any permutation 7 € Sq on d elements, we associate the simplex 
A, := conv {0, e71); €701) + €r(2) +++ €101) + en(2) H + en(a) t, 


where e1,€2,...,@q denote the unit vectors in R. 


(a) Prove that {A, : 7 € Sa} is a triangulation of the unit d-cube [0, 1]¢. 

(b) Prove that all A, are congruent to each other, that is, each one can be 
obtained from any other by reflections, translations, and rotations. 

(c) Show that for all 7 € Sa, La, (t) = ($°). 
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3.2. & Suppose T is a triangulation of a pointed cone. Prove that the inter- 
section of two simplicial cones in T is again a simplicial cone. 


3.3. Find the generating function ox (z) for the following cones: 


(a) K= {A1 (0, 1) + A2(1, 0) i 1, A2 > 0}; 
(b) K = {A1 (0, 1) T à2(l, 1) H 1, A2 > 0}; 
(c) K= {(3, 4) + A1(0, 1) + r2(2, 1) 5 1, A2 = 0}. 





3.4. % Let S C R” and T C R”. Show that osxrT (21, 22,---,2mtn) = 
os (21,223: --; Zm) Or (emitem ee ere 


3.5. $% Let K be a rational d-cone, and let m € Z%. Show that om4x(z) = 
Z™on(Z). 


3.6. & For a set S C R, let —S := {-x: x € S}. Prove that 


1 1 1 
O_g (21, Z2,- --, Zd) = 0s PT oe S pte 


Z1 22 Zd 
3.7. Given a pointed cone K C R? with apex at the origin, let S := K N Zt. 
Show that if x,y € S then x+y € S. (In algebraic terms, S is a semi- 


group, since 0 € S and associativity of the addition in S follows trivially from 
associativity in R.) 





3.8. & Prove Lemma 3.9: If 


g{2) 
Tye = a. 
t>0 
then f is a polynomial of degree d if and only if g is a polynomial of degree 
at most d and g(1) Æ 0. 


3.9. Prove that (7); (R); ...,(%) is a basis for the vector space Pol, of 


n 
polynomials (in the variable x) of degree less than or equal to n. 


3.10. For a polynomial p(t) = catt + ca_1t* 1 +- -+ co, let Hp(z) be defined 
by 
H(z 
>) CEEPUS 
t>0 
Consider the map ġa: Polą — Pola, pt> Hp. 


(a) Show that ¢q is a linear transformation. 
(b) Compute the matrix describing ġa for d = 0, 1,2,.... 
(c) Deduce formulas for hz, hg, ..., similar to the one in Corollary 3.16. 


3.11. Compute the Ehrhart polynomials and the Ehrhart series of the sim- 
plices with the following vertices: 
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(a) ( 
(b) ( 


3.12. Define the hypersimplex A(d,k) as the convex hull of 


, 0, 0), (1, 0,0), (0, 2, 0), and (0, 0, 3); 
,0, 0,0), (1,0, 0,0), (0, 2, 0, 0), (0, 0, 3, 0), and (0, 0,0, 4). 


oo 


{ej + ej hee + Eezy : 1<jı < j2 <: < jk <d}, 


where e1, €2, . . . , €q are the standard basis vectors in R?. For example, A(d, 1) 
and A(d,d—1) are regular (d—1)-simplices. Compute the Ehrhart polynomial 
and the Ehrhart series of A(d, k). 


3.13. & Suppose H is the hyperplane given by 
H={xeR?: a, 2 + agz2 +++: +agxq = 0} 


for some a1, @2,...,@q E Z, which we may assume to have no common factor. 
Prove that there exists v € Z? such that Uez ((nv + H) N Zt) = Z°. (This 
implies, in particular, that the points in Z? \ H are all at least some minimal 
distance away from H; this minimal distance is essentially given by the dot 
product of v with (aj, a@2,...,@a).) 


3.14. & A hyperplane H is rational if it can be written in the form 
H= {xeER*: azı + a283 +--+ + agta =b} 


for some a1,@2,...,aq,) € Z. A hyperplane arrangement in R? is a finite 
set of hyperplanes in Rt. Prove that a rational hyperplane arrangement H 
can be translated so that no hyperplane in H contains any integer points. 


3.15. The conclusion of the previous exercise can be strengthened: Prove that 
a rational hyperplane arrangement H can be translated such that no hyper- 
plane in H contains any rational points. 


3.16. Show that, given distinct numbers x, £2,...,£q+1, the matrix 
a? r9! ©. xı 1 
xf woe ©: z2 1 
V = 
d d-1 
Tayi Tayi Tati 1 


is not singular. (V is known as the Vandermonde matriz.) 


3.17. Let P be an integral d-polytope. Show that 


d 
volP = = (n! +5 3 Coet) . 
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3.18. As in Example 3.22, let 7„ be the tetrahedron with vertices (0,0, 0), 
(1,0,0), (0,1,0), and (1,1,n), where n is a positive integer. Show that the 
volume of 7, is unbounded as n — oo, yet for all n, Zn has no interior and 
four boundary lattice points. This example proves that Pick’s theorem does 
not hold for a three-dimensional integral polytope P, in the sense that there 
is no linear relationship among vol P, Lp(1), and Lpe(1). 


3.19. & Prove Lemma 3.24: If 


fs = 


t>0 


g(z) 
A(z)’ 





then f is a quasipolynomial of degree d with period dividing p if and only if g 
and h are polynomials such that deg(g) < deg(h), all roots of h are p*® roots 
of unity of multiplicity at most d+1, and there is a root of multiplicity equal 
to d+ 1 (all of this assuming that g/h has been reduced to lowest terms). 


3.20. & Provide the details for the proof of Theorem 3.23: If P is a rational 
convex d-polytope, then Lp(t) is a quasipolynomial in t of degree d. Its period 
divides the least common multiple of the coordinates of the vertices of P. 


3.21. Let 7 be the rational triangle with vertices (0,0), (1, vt), and (p,0), 


where p is a fixed integer > 2. Show that Lr(t) = % t? + tt + 1; in 
particular, Ly is a polynomial. 





3.22. Prove that for any d > 2 and any p > 1, there exists a d-polytope P 
whose Ehrhart quasipolynomial is a polynomial (i.e., it has period 1), yet P 
has a vertex with denominator p. 


3.23. Prove that the period of the Ehrhart quasipolynomial of a 1-dimensional 
polytope is always equal to the lcm of the denominators of its vertices. 


3.24. Let T be the triangle with vertices (—4, —ż), (5, —4), and (0, 3). Show 
that Lr (t) = t? + c(t)t+ 1, where 


1 if t is even, 
c(t) = a 
0 if t is odd. 


(This shows that the periods of the “coefficients” of an Ehrhart quasipoly- 
nomial do not necessarily increase with decreasing power.) Find the Ehrhart 
series of T. 


3.25. Prove the following extension of Theorem 3.12: Suppose P is a rational 
d-polytope with denominator p. Then 


Ehrp(z) = erase 


where f is a polynomial with nonnegative integral coefficients. 
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3.26. Find and prove a statement that extends Lemma 3.14 to Ehrhart qua- 
sipolynomials. 


3.27. Prove the following extension of Corollary 3.15 to rational polytopes. 
Namely, the Ehrhart quasipolynomial Lp of the rational convex polytope 
P CR? satisfies Lp(0) = 1. 


3.28. Prove the following analogue of Corollary 3.17 for rational polytopes: 
Suppose P is a rational polytope with Ehrhart quasipolynomial Lp(t) = 
ca(t) t? + cq_1(t) t2-1 +--+ + e1(t)t + colt). Then for allt € Z and0<k <d, 
we have d! c,(t) € Z. 


3.29. & Prove that Corollary 3.20 also holds for rational polytopes: Sup- 
pose P C Rê is a rational convex d-polytope with Ehrhart quasipolynomial 
ca(t) t? + ca_1(t) t! + --- + colt). Then calt) equals the volume of P; in 
particular, cq(t) is constant. 


3.30. Suppose P is a rational convex polytope. Show that as rational func- 
tions, 


Ehrzp(z) = 5 (Bhrp (vZ) — Ehrp (—2)). 


3.31. Suppose f and g are quasipolynomials. Prove that the convolution 


t 


F(t) := J f(s) g(t - 8) 


s=0 


is also a quasipolynomial. What can you say about the degree and the period 
of F, given the degrees and periods of f and g? 


3.32. Given two positive, relatively prime integers a and 8, let 


no {i if alt, A w= if blt, 


0 otherwise, 0 otherwise. 
Form the convolution of f and g. What function is it? 


3.33. Suppose P C R™ and Q C R” are rational polytopes. Prove that the 
convolution of Lp and Lg equals the Ehrhart quasipolynomial of the polytope 
given by the convex hull of P x {0,} x {0} and {0m} x Q x {1}. Here Og 
denotes the origin in R. 


3.34. We define the unimodular group SL, (Z) as the set of all dxd matrices 
with integer entries and determinant +1. 





(a) Show that SLa (Z) acts on the integer lattice Z as a one-to-one, onto 
map. That is, fix any A € SLa (Z). Then A maps Zê to itself in a bijective 
fashion. 
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(b) For any open simplex A° C R? and any A € SL, (Z), consider the image 
of A° under A, defined by A (A°) := {Ax: x € A°}. Show that 


#{4° AZI} = # {A (4°) NZ. 


(c) Let P be an integral polytope, and let Q := A (P), where A € SLa (Z), 
so that P and Q are unimodular images of each other. Show that Lp (t) = 
Lo(t). (Hint: Write P as the disjoint union of open simplices.) 


3.35. Search on the Internet for the program LattE: Lattice-Point Enumera- 
tion [66, 115]. You can download it for free. Experiment. 


Open Problems 


3.36. How many triangulations are there for a given polytope? 


3.37. What is the minimal number of simplices needed to triangulate the unit 
d-cube? (These numbers are known for d < 7.) 


3.38. Classify the polynomials of a fixed degree d that are Ehrhart polyno- 
mials. This is completely done for d = 2 [160] and partially known for d = 3 
and 4 [24, Section 3]. 


3.39. Study the roots of Ehrhart polynomials of integral polytopes in a fixed 
dimension [24, 37, 42, 94]. Study the roots of the numerators of Ehrhart series. 


3.40. Come up with an efficient algorithm that computes the period of an 
Ehrhart quasipolynomial. (See [188], in which Woods describes an efficient 
algorithm that checks whether a given integer is a period of an Ehrhart qua- 
sipolynomial.) 


3.41. Suppose P and Q are integer polytopes with the same Ehrhart poly- 
nomial, that is, Lp(t) = Lo(t). What additional conditions on P and Q do 
we need to ensure that integer translates of P and Q are unimodular images 
of each other? That is, when is Q = A (P) + m for some A € SLa (Z) and 
m € Z4? 


3.42. Find an “Ehrhart theory” for irrational polytopes. 


A 





Reciprocity 


In mathematics you don’t understand things. You just get used to them. 


John von Neumann (1903-1957) 


While Exercise 1.4 (i) gave us the elementary identity 


t-1 —t 
| | - | | 1 (4.1) 
a a 
for t € Zand a € Zyo, this fact is a special instance of a more general theme. 


Namely, (4.1) marks the simplest (one-dimensional) case of a reciprocity the- 
orem that is central to Ehrhart theory. Let Z := [0,1/a] C R, a rational 








Fig. 4.1. Lattice points in tZ. 


1-polytope (see Figure 4.1). Its discrete volume is (recalling Exercise 1.3) 


Tee + i 


The lattice-point enumerator for the interior T° 
is 


Or = 


(see Exercise 4.1). The identity (4.1) says that algebraically, 


(0, 1/a), on the other hand, 





(4.2) 


Lre (t) = —Lz(—t). 
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This chapter is devoted to proving that a similar identity holds for rational 
polytopes in any dimension: 


Theorem 4.1 (Ehrhart—Macdonald reciprocity). Suppose P is a convex 
rational polytope. Then the evaluation of the quasipolynomial Lp at negative 
integers yields 

Lp(—t) = (—1)9™? Lpo (t). 


This theorem belongs to a class of famous reciprocity theorems. A common 
theme in combinatorics is to begin with an interesting object P, and 


1. define a counting function f(t) attached to P that makes physical sense 
for positive integer values of t; 

2. recognize the function f as a polynomial in t; 

3. substitute negative integral values of t into the counting function f, and 
recognize f(—t) as a counting function of a new mathematical object Q. 


For us, P is the closure of a polytope and Q is its interior. 


4.1 Generating Functions for Somewhat Irrational Cones 


Our approach to proving Theorem 4.1 parallels the steps of Chapter 3: we 
deduce Theorem 4.1 from an identity for rational cones. We start with a 
reciprocity theorem for simplicial cones. 


Theorem 4.2. Fiz linearly independent vectors w1, W2,..., Wa € ZÊ, and let 
K = {iwi + A2w2 +- + Aawa : A1,..-,Aa > O}, the simplicial cone gener- 
ated by the w;’s. Then for those v € R? for which the boundary of the shifted 
simplicial cone v + K contains no integer point, 





1 1 1 
ov ( bet j= 1) oa Cig tap irra): 


? ? 3 
Z1 22 Zd 


Remark. This theorem is meaningless on the level of formal power series; 
however, the identity holds at the level of rational functions. We will establish 
that oy+x is a rational function in the process of proving the theorem. 


Proof. As in the proofs of Theorem 3.5 and Corollary 3.6, we have the formula 


pier Ov+t(Z) 
Oven ( ) > (1 — 21) (1 — zW2). sy (1 — zwa) , 





where II is the open parallelepiped 
I = {Aywi + Agwe +--+ + Aawa : 0< A, A2,.++5Ad < 1}. (4.3) 


This also proves that oy+ is a rational function. Note that by assumption, 
v + II contains no integer points on its boundary. Naturally, 
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o-v+m(Z) 
1—2z™1) (1 — 22)... (1 — zwa)’ 





o-v+K(Z) = ( 
so we need to relate the parallelepipeds v + II and —v + II. This relation is 
illustrated in Figure 4.2 for the case d = 2; the identity for general d is (see 
Exercise 4.2) 


v= -(-v +I) we we ea (4.4) 


Now we translate the geometry of (4.4) into generating functions: 




















-v +I — (—-v +I) (-v +I) + wi + we 
Fig. 4.2. From —v+ II tov + II. 


Wi gw? j Wa 


ov+ (2) = O_(-v4m (2) Z 


=. iL 1 1 Wi,W2,,.7Wd 
=ovin(2,2,..,2)2 Z Z 


Z 


(the last equation follows from Exercise 3.6). Let’s abbreviate the vector 


1 1 1 1 “hes 3 ‘ a 
(4, A 1) by 5. Then the last identity is equivalent to 





1 ae 4 
Ov4Tl ( = O_vym(Z) BZ MP 2, 


whence 
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salt Ys avan (2) 
y+ z) (ae ay Loe ee eh mae) 
O-v4m(z) Zz Wz W2.. g7 W4 


Q02) (z)... (zwa) 














= o_v+m(Z) 
(z¥1 — 1) (2w2 — 1) --- (zwa — 1) 
= ( 1)? o_v+m(Z) 


(1 — 21) (1—22)--- (1 — zwa) 
= (1) o-v4k(2). 














4.2 Stanley’s Reciprocity Theorem for Rational Cones 


For the general reciprocity theorem for cones, we patch the simplicial cones of a 
triangulation together, in a manner very similar to our proof of Theorem 3.12. 


Theorem 4.3 (Stanley reciprocity). Suppose K is a rational d-cone with 
the origin as apex. Then 


1 1 1 
ax ( , ee) Je 1)? oxe (21, 22,---52a)- 





Zi 22 Žad 


Proof. Triangulate K into the simplicial cones K1, K2,...,Km. Now Exer- 
cise 3.14 ensures that there exists a vector v € R? such that the shifted cone 
v + K contains exactly the interior lattice points of K, 


K? AZ? = (v +K) NZ$, (4.5) 


and there are no boundary lattice points on any of the triangulation cones: 


Av+tK,;)NZ =o for all j =1,...,m, (4.6) 
as well as 
ð (v +K) NZ? =Ø for all j =1,...,m. (4.7) 
We invite the reader (Exercise 4.3) to realize that (4.5)-(4.7) imply 
KAZ? = (-v +K) NAZ’. (4.8) 


Now by Theorem 4.2, 


m 
OK (2) = = o-v+x ( 2s O_v+K; = (-1 1 ogi; (z) 


= (-1)% ov4K w) = (-1)" ox: (2). 


Note that the second and fourth equalities are true because of the validity of 
(4.7) and (4.6), respectively. 
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4.3 Ehrhart—Macdonald Reciprocity for Rational 
Polytopes 


In preparation for the proof of Theorem 4.1, we define the Ehrhart series 
for the interior of the rational polytope P as 


Ehrpo (z =. Lpo(t 
t>1 


Our convention of beginning the series with t = 1 stems from the fact that 
this generating function is a special evaluation of the integer-point transform 
of the open cone (cone(P))°: Much in sync with Lemma 3.10, we have 


Bhrpo(z) = O(cone(p)) (1,1,...,1, 2). (4.9) 
We are now ready to prove the Ehrhart-series analogue of Theorem 4.1. 


Theorem 4.4. Suppose P is a convex rational polytope. Then the evaluation 
of the rational function Ehrp at 1/z yields 


Ehrp C) = (-1)9™ P+ Ehrpo(z). 
z 


Proof. Suppose P is a d-polytope. We recall Lemma 3.10, which states that 
the generating function of the Ehrhart polynomial of P is an evaluation of 
the generating function of cone(P): 


Ehrp(z = So Lp(t) zt = Ocone(P) (1, 1,- -, 1,2). 
t>0 


Equation (4.9) above gives the analogous evaluation of o(cone(p))° that yields 
Ehrpo. Now we apply Theorem 4.3 to the (d + 1)-cone K = cone(P): 














1 
F(cone(P))° (1, 1,...,1, z) = (-1)4? Ocone(P) (i 1,...,1, =) : 


Theorem 4.1 now follows like a breeze. 


Proof of Ehrhart-Macdonald reciprocity (Theorem 4.1). We first apply Exer- 
cise 4.5 to the Ehrhart series of P: namely, as rational functions, 


Bhrp ($ Js Don Be 
t<0 t>1 
Now we combine this identity with Theorem 4.4 to obtain 
X Lpe(t) 2t = (-1)"' Ehrp C =e Lye Se Lp(— 
t>1 t>1 


Comparing the coefficients of the two power series yields the reciprocity the- 
orem. 
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With Ehrhart—Macdonald reciprocity, we can now restate Theorem 3.18 
in terms of Ehrhart polynomials: 


Theorem 4.5. Suppose P is an integral d-polytope with Ehrhart series 


haztthe12e t+: +h etl 
Ehrp(z) = (nan ; 








Then ha = ha-1 = +++ = hk+1 = 0 and hy £ 0 if and only if (d — k + 1)P is 
the smallest integer dilate of P that contains an interior lattice point. 





Proof. Theorem 3.18 says that hy, is the highest nonzero coefficient if and only 
if Lp(—1) = Lp(—2) =--- = Lp (—(d—k)) = 0 and Lp (-(d—k + 1)) £0. 
Now use Ehrhart—Macdonald reciprocity (Theorem 4.1). 














The largest k for which hy # 0 is called the degree of P. The above 
theorem says that the degree of P is k precisely if (d —k+1)P is the smallest 
integer dilate of P that contains an interior lattice point. 


4.4 The Ehrhart Series of Reflexive Polytopes 


As an application of Theorem 4.4, we now study a special class of integral 
polytopes whose Ehrhart series have an additional symmetry structure. We 
call a polytope P that contains the origin in its interior reflexive if it is 
integral and has the half-space description 


P={xeR?: Ax<1}, 


where A is an integral matrix. (Here 1 denotes a vector all of whose co- 
ordinates are 1.) The following theorem gives a characterization of reflexive 
polytopes through their Ehrhart series. 


Theorem 4.6 (Hibi’s palindromic theorem). Suppose P is an integral 
d-polytope that contains the origin in its interior and that has the Ehrhart 
series 





haz? + ha-1 2t +--+ hız + ho 
Ehrp(z) = (1—z)41 $ 


Then P is reflexive if and only if hk = ha-p for all 0 < k < d, 


The two main ingredients for the proof of this result are Theorem 4.4 and 
the following: 


Lemma 4.7. Suppose a1,a2,...,ada,b € Z satisfy gcd (a1, a2,...,@aa,b) = 1 
and b > 1. Then there exist positive integers c and t such that tb < c < (t+1)b 
and {(m1;,m2,..., Mma) EZI: amı +azMmz +--+ adMa = c} £ Ø. 
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Proof. Let g = gcd (aj, a@2,...,@4); by our assumption, gcd(g, b) = 1, so one 
can find integers k and t such that 


kg —tb=1. (4.10) 


Furthermore, we can choose k and t in such a way that t > 0. Let c = kg; 
equation (4.10) and the condition b > 1 imply that tb < c < (t + 1)b. Finally, 
since g = gcd (a1, a2, ..., aq) there exist m,,mz2,...,Mq E Z such that 














amı +agm2+---+aqma=kg=c. 
Proof of Theorem 4.6. We recall that P is reflexive if and only if 
P={xe R’: Ax< 1} for some integral matrix A. (4.11) 
We claim that P has such a half-space description if and only if 
P° NZ? = {0} and for all t € Zso, (t+1)P°NZ2=tP NZ. (4.12) 


This condition means that the only lattice points that we gain when pass- 
ing from tP to (t + 1)P are those on the boundary of (t + 1)P. The fact 
that (4.11) implies (4.12) is the content of Exercise 4.11. Conversely, if P 
satisfies (4.12) then there are no lattice points between tH and (t+ 1)H for 
any facet hyperplane H of P (Exercise 4.12). That is, if a facet hyperplane 
is given by H = {x ER? : axı + aorg+---+agrg = bh, where we may as- 
sume gcd (a1, d2,...,@q,0) = 1, then 


{xe Zt: tb < arxı +azx2 +--+ +agxq < (t+ 1)d} =Ø. 


But then Lemma 4.7 implies that b = 1, and so P has a half-space description 
of the form (4.11). 

Thus we have established that P is reflexive if and only if it satisfies (4.12). 
Now by Theorem 4.4, 





= ho zee) + hy zt +--+ ha-12?+haz 
= (1-24 





Ehrpo (z) = (—1)%+! Ehrp (G) 


By condition (4.12), P is reflexive if and only if this rational function is equal 
to 


X Lplt- j= zX Lp(t) zt = z Ehrp (z) 


t>1 t>0 
= haz?! + ha-1 et +--+ hiz? +hoz 
E (1— z)¢+1 i 








that is, if and only if hy = ha_x for all O < k < d, 
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4.5 More “Reflections” on the Coin-Exchange Problem 
and the Gallery of Chapter 2 


We have already encountered special cases of Ehrhart-Macdonald reciprocity 
several times. Note that Theorem 4.1 allows us to conclude that counting 
the number of interior lattice points in a rational polytope is tantamount to 
counting lattice points in its closure. Exercises 1.31, 2.1, and 2.7, as well as 
part (b) of each theorem in the gallery of Chapter 2, confirm that 


Lp(—t) = (—1)9™? Lpo(t). 


Notes 


1. Ehrhart—Macdonald reciprocity (Theorem 4.1) had been conjectured (and 
proved in several special cases) by Eugène Ehrhart for about a decade before 
I. G. Macdonald found a general proof in 1971 [123]. One can actually relax the 
condition of Ehrhart—Macdonald reciprocity: it holds for polytopal complexes 
that are homeomorphic to a d-manifold. The proof we give here (including 
the proof of Theorem 4.3) appeared in [30]. 


2. Theorem 4.3 is due to Richard Stanley [169], who proved more general 
versions of this theorem. The reader might recall that the rational function 
representing the Ehrhart series of a rational cone can be thought of as its 
meromorphic continuation. Stanley reciprocity (Theorem 4.3) gives a func- 
tional identity for such meromorphic continuations. 


3. The term reflexive polytope was coined by Victor Batyrev, who found ex- 
citing applications of these polytopes to mirror symmetry in physical string 
theory [16]. Batyrev proved that the toric variety Xp defined by a reflexive 
polytope P is Fano, and that every generic hypersurface of Xp is Calabi- Yau. 
That the Ehrhart series of a reflexive polytope exhibits an unexpected sym- 
metry (Theorem 4.6) was discovered by Takayuki Hibi [97]. The number of 
reflexive polytopes in dimension d is known for d < 4 [117, 118]; for example, 
there are precisely 16 reflexive polytopes in dimension 2, up to symmetries 
(see also [165, Sequence A090045]). A striking result is that the sum of the 
numbers of lattice points on the boundaries of a reflexive polygon and its dual 
is always 12 [147]. A similar result holds in dimension 3 (with 12 replaced by 
24) [18], but no elementary proof of the latter fact is known [22, Section 4]. 


4. There is an equivalent definition for reflexive polytopes: P is reflexive 
if and only if both P and its dual P* are integral polytopes. The dual 
polytope of P (often also called the polar polytope) is defined as P* := 
{x €R¢:x-y<1forallye P}. The concept of (polar) duality is not con- 
fined to polytopes but can be defined for any nonempty subset of Rt. Duality 
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is a crucial chapter in the theory of polytopes, and one of its applications is 
the equivalence of the vertex and hyperplane description of a polytope. For 
more about (polar) duality, the reader might consult [12, Chapter IV]. 


5. The cross-polytopes © from Section 2.5 form a special class of reflexive 
polytopes. We mentioned in the Notes of Chapter 2 that the roots of the 
Ehrhart polynomials Lo all have real part —3 (51, 109]. Christian Bey, Martin 
Henk, and Jörg Wills proved in [37] that if all roots of Lp, for some integral 
polytope P, have real part —ż, then P is the unimodular image of a reflexive 
polytope. 


Exercises 





t—1 
4.1. & Prove (4.2): For a € Zo, Lo,1/a)(t) = | i 
' a 


4.2. & Explain (4.4): If wi, W2,..., Wa € R? are linear independent and 


TI = {Aywi + Agwe +--+ + Agwa: 0 < Aq, A2,---,Aq < 1}, 








then v + Il = —(-v + TI) + wi + wo +- + Wa. 


4.3. & Prove that (4.5)-(4.7) imply (4.8); that is, if X is a rational pointed 
d-cone K with the origin as apex, and v € Rf is such that 


K° AZ? = (v +K) NZ$, 
Av+K;)NZ =o for all j =1,...,m, 
and 
ð (-v +K) NZ! =ø for all j = 1,...,m, 
then 
KAZ? = (-v+ K)nZ?*. 
4.4. Prove the following generalization of Theorem 4.3 to rational pointed 
cones with arbitrary apex: Suppose XK is a rational pointed d-cone K with the 


origin as apex, and v € R¢. Then the integer-point transform oy+(z) of the 
pointed d-cone v + K is a rational function that satisfies 


oer (F) = CDon 0). 


4.5. & Suppose Q: Z — C is a quasipolynomial. We know that R (2) = 
so Q(t) 2 evaluates to a rational function. 
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(a) Prove that Ro(z) := deo Q(t) z also evaluates to a rational function. 

(b) Let Q(t) = 1. Prove that as rational functions, RG(2) + Ro(z) = 0. 

(c) Suppose Q is a polynomial. Prove that as rational functions, RG(2) + 
Ro(z) = 0. 

(d) Suppose Q is a quasipolynomial. Prove that as rational functions, RÈ (z)+ 
Ro(z) = 0. 





4.6. & Suppose that P is a rational d-polytope for which 
Lpo (t) = Lp(t ea k) and Lpo (1) = Lpo (2) =.. = Lpo (k = 1) =0 


for some integer k. (This situation applies to some of the polytopes in the 
gallery of Chapter 2.) Prove that 


zZ 


Ehrp C) = (—1)°+! 2" Ehrp (z). 


4.7. Suppose P is an integral d-polytope with Ehrhart series 





haz? + ha-12 t+- +hiz+1 
Ehrp(z) = (aan ; 


Prove that ha = Lpo (1). 


4.8. Suppose P is a convex integral d-polytope. Show that the dilate (d+ 1)P 
contains an interior lattice point. 


4.9. Suppose P is a convex integral polytope. Denote the boundary of P by 
OP. Prove that Lap(t) is a polynomial that is either even or odd. Determine 
its constant term. 


4.10. Recall the restricted partition function 
P{a1,09,...,an} (E) >= # {(ma, Ma) € ZŠo : M101 +: + Maad = t} 


from Chapter 1. Prove that as quasipolynomials, 





P{a1,a2,..., asl t ai a2 eas an) = (1 t Piara ee) 
and that 


P{ai,a2 ihe an} (71) oa P{ar,a2 nan} (72) SORT 
= P{a1,a2 BO Goel — ag —-+++— Gn + 1) =0. 
4.11. & Prove that (4.11) implies (4.12), that is, show that if the polytope P 
is given by P = {x € R’: Ax <1} for an integral matrix A, then P? NZ’ = 
{0} and for all t € Zyo, (t+ 1)P°NZ4 = tPA Z’. 


4.12. & Suppose P is an integral polytope that satisfies (4.12): P° MZ? = {0} 
and for all t € Zso, (t + 1)P° N Z4 = tP N Z°. Then for any t € Z, there are 
no lattice points between tH and (t +1)H for any facet hyperplane H of P. 
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Open Problems 


4.13. Prove the following conjecture of Batyrev [17]: For an integer polytope 
P of degree j, the volume of P is bounded by a constant that depends only 
on hj. 


4.14. Suppose P is a 3-dimensional reflexive polytope. Denote by e* the edge 
in the dual polytope P* that corresponds to the edge e in P. Give an elemen- 
tary proof that 
5 length (e) - length (e*) = 24. 
e edge of P 


4.15. Find the number of reflexive polytopes in dimension d > 5. 
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Face Numbers and the Dehn—Sommerville 
Relations in Ehrhartian Terms 


“Data! Data! Data!” he cried, impatiently. “I can’t make bricks without clay.” 


Sherlock Holmes (“The Adventure of the Copper Beeches,” by Arthur Conan Doyle, 
1859-1930) 


Our goal in this chapter is twofold, or rather, there is one goal in two different 
guises. The first one is to prove a set of fascinating identities, which give 
linear relations among the face numbers fp. They are called Dehn—Sommerville 
relations, in honor of their discoverers Max Wilhelm Dehn (1878-1952)! and 
Duncan MacLaren Young Sommerville (1879-1934).? Our second goal is to 
unify the Dehn-Sommerville relations (Theorem 5.1 below) with Ehrhart- 
Macdonald reciprocity (Theorem 4.1). 


5.1 Face It! 


We denote the number of k-dimensional faces of P by the symbol fp. As k 
varies from 0 to d, the face numbers fp encode intrinsic information about 
the polytope P. The d-polytope P is simple if each vertex of P lies on 
precisely d edges of P. 


Theorem 5.1 (Dehn—Sommerville relations). If P is a simple d-polytope 
and 0 < k< d, then 


1 For more information about Dehn, see 
http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Dehn.html. 

2 For more information about Sommerville, see 
http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Sommerville.html. 
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This theorem takes on a particularly nice form for k = d, namely the 
famous Euler relation, which holds for any polytope (not just simple ones). 


Theorem 5.2 (Euler relation). If P is a convex d-polytope, then 


d 
do (-1)' fy = 1. 


j=0 


This identity is less trivial than it might look. We give a quick proof 
for rational polytopes, for which we can use Ehrhart—Macdonald reciprocity 
(Theorem 4.1). 


Proof of Theorem 5.2, assuming P is rational. Let us count the integer points 
in tP according to the (relatively) open faces that contain them:? 


Lp(t) = X` Lrt) = X (CDF Le(-t). 


FCP FCP 


Here, and in the remainder of this chapter, the sums are over all nonempty 
faces. (Alternatively, we could agree that Lg(t) = 0.) The constant term of 
L(t) is 1, for any face F (by Exercise 3.27). Hence the constant terms of the 
identity above give 


d 


1= OED =P h, 


FCP j=0 











which proves our claim. 





There is a natural structure on the faces of a polytope P induced by the 
containment relation F C G. This relation gives a partial ordering on the set 
of all faces of P, called the face lattice of P.4 A useful way to illustrate 
this partially ordered set is through a graph whose nodes correspond to the 
faces of P, such that two nodes are adjacent if one of their corresponding 
faces contains the other. In Figure 5.1, we give the face lattice for a triangle. 
Exercise 2.6 implies that the face lattice of any simplex is a Boolean lattice, 
which is the partially ordered set formed by all subsets of a finite set, where 
the partial ordering is again subset containment. 

We already mentioned that we will unify the Dehn—Sommerville relations 
(Theorem 5.1) with Ehrhart—-Macdonald reciprocity (Theorem 4.1). It is for 
this reason that we will prove Theorem 5.1 only for rational polytopes. To 
combine the notions of face numbers and lattice-point enumeration, we define 


3 Note that the relative interior of a vertex is the vertex itself. 
4 The usage of the word lattice here is disjoint from our previous definition of the 
word. 
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A 
V3 
E2 E3 Ey 
: be 
Vi E; V1 V3 
v2 
Ø 


Fig. 5.1. The face lattice of a triangle. 


Rt)= X Lz), 
FCP 
dim F=k 


the sum being taken over all k-faces of P. By Ehrhart’s theorem (Theo- 
rem 3.23), F, is a quasipolynomial. Since L¢(0) = 1 for all F, 


Fi,(0) = fr, 


the number of k-faces of P. We also remark that the leading coefficient of Fi, 
measures the relative volume of the k-skeleton of P, that is, the union of all 
k-faces; see Section 5.4 for a precise definition of relative volume. 

Our common extension of Theorems 5.1 and 4.1 is the subject of the next 
section. 


5.2 Dehn—Sommerville Extended 


Theorem 5.3. If P is a simple rational d-polytope and O < k < d, then 


= DEN Ce) F(t). 


j=0 


The classical Dehn—Sommerville equations (Theorem 5.1)—again, only for 
rational polytopes—are obtained from the constant terms of the counting 
functions on both sides of the identity. On the other hand, for k = d, Theo- 
rem 5.3 gives (with t replaced by —t) 


d 
Lp(—t) = Fa(—t) = $0 (-1) y(t) = (-1)7 0 (-1)* FC 


j=0 j=0 


The sum on the right-hand side is an inclusion—-exclusion formula for the 
number of lattice points in the interior of tP (count all the points in P, 
subtract the ones on the facets, add back what you’ve overcounted, etc.), so 
in a sense we recover Ehrhart-Macdonald reciprocity. 
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Proof. Suppose F is a k-face of P. Then, again by counting the integer points 
in F according to relatively open faces of F, 


L(t) = Ñ Leo(t), 


GCF 


or, by the Ehrhart-Macdonald reciprocity (Theorem 4.1), 


k 
Lrt) = $ (1) te(-t) =X -1 $, Ie(—t). (81) 
GCF j=0 GCF 
dim G=j 
Now sum both left- and right-hand sides over all k-faces and rearrange the 
sum on the right-hand side: 


CP GCF 


F 
dim F=k dim G=j 


k 
=X (-1} XO fe(P/9)Lo(-t) 
j=0 SSE 


k 7 
Doy E (Ga Ate 
g=0 GCP 
k 
> 


dim G=j 


1p (g RD. 


Here fk(P/G) denotes the number of k-faces of P containing a given j-face G 


of P. Since P is simple, this number equals (eS) (see Exercise 5.4). 














5.3 Applications to the Coefficients of an Ehrhart 
Polynomial 


We will now apply Theorem 5.3 to the computation of the Ehrhart polynomial 
of an integral d-polytope P. The only face-lattice-point enumerator involving 
the face P is Fy(t), for which Theorem 5.3 specializes to 


d 
Lp(t) = Falt) = X (- 17 F; (—t). 


j=0 
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In fact, we do not have to assume that P is simple, since this identity simply 
counts integer points by faces. (Recall that (—1)’F;(—t) counts the integer 
points in the ¢-dilates of the interior of the j-faces.)° The last term on the 
right-hand side is 


(—1)"Fa(—t) = (-1)*L p(t) = Lpo (t) 





by Ehrhart—Macdonald reciprocity. Shifting this term to the left gives 


a 
= 


Lp(t) — Lpo (t) = ŅO(-1} F;(—t). (5.2) 
0 


j 
The difference on the left-hand side of this identity has a natural interpre- 
tation: it counts the integer points on the boundary of tP. (And in fact, the 
right-hand side is once more an inclusion-exclusion formula for this num- 
ber.) Let us write Lp(t) = cat? + ca-1t™! +- + co. Then Lpo(t) = 
cat? — ca—1 t? +--+ + (-1)4c0, so that 


Lp(t) — Lpo (t) = 2cq-1 pets 2ca—3 te-3 4... ; 


where this sum ends with 2co if d is odd and 2c,t if d is even (this should 
look familiar; see Exercise 4.9). Combining this expression with (5.2) yields 
the following useful result. 


Theorem 5.4. Suppose Lp(t) = cat? + cg_1t* 1 +- + co is the Ehrhart 
polynomial of P. Then 


a 
en 


m l 
ca-1 tt?) heegt? Hm 5 (=1¥ F;(—t). 
0 

















a. 
Il 


We can make the statement of this theorem more precise (but also more 
messy) by writing 


F;(t) = 5 Lr(t) = Cjj ti + Cjj—1t? + +--+ +Cj,0- 
ay 
im F=j 


Then collecting the coefficients of t* in Theorem 5.4 yields the following rela- 
tions. 
Corollary 5.5. If k and d are of different parity, then 


d—1 


1 l 
ck = 5 9 (I ej. 


j=0 














5 So one might argue that we did not need the Dehn-Sommerville machinery for 
the computations in the current section. This argument is correct, although The- 
orem 5.3 is a strong motivation. 
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If k and d have the same parity, then the left-hand side has to be replaced 
by 0. 

The first coefficient c, in the Ehrhart polynomial of a d-polytope P satis- 
fying the parity condition is cg_;. In this case Corollary 5.5 tells us that cq_ 
equals 5 times the sum of the leading terms of the Ehrhart polynomials of 
the facets of P. 

The next interesting coefficient is cg_3. For example, if dim P = 4, we can 
use Corollary 5.5 to compute cı entirely from (the linear coefficients of) the 
Ehrhart polynomials of the faces of dimension < 3. 


5.4 Relative Volume 


It’s time to return to continuous volume. Recall Lemma 3.19: if S C Rt is 
d-dimensional, then vol S = limz.. a -# (tS N Z4), Back in Chapter 3, we 
stressed the importance of S being d-dimensional, because otherwise (i.e., S 
is lower-dimensional although living in d-space), by our definition vol S = 0. 
However, the case that S C R? is not of dimension d is often very in- 
teresting; an example is the polytope P that we encountered in connec- 
tion with the coin-exchange problem in Chapter 1. We still would like to 
compute the volume of such objects, in the relative sense. This makes for 
a slight complication. Let’s say S C R? is of dimension m < d, and let 
span S = {x + Aly — x): x,y € S, ÀA € R}, the affine span of S. If we follow 
the same procedure as above (counting boxes or grid points), we compute 
the volume relative to the sublattice (span S) N Zf; we call this the relative 
volume of S. 

For example, the line segment L from (0,0) to (4,2) in R? has the relative 
volume 2, because in span L = { (x,y) ER?: y= z/2}, L is covered by two 
segments of “unit length” in this affine subspace, as pictured in Figure 5.2. A 
three-dimensional instance that should be reminiscent of Chapter 1 is illus- 
trated in Figure 5.3. 





Fig. 5.2. The line segment from (0,0) to (4,2) and its affine sublattice. 


5.4 Relative Volume 101 











Fig. 5.3. The triangle defined by $+ $ + å = 1,<% > 0,y >0,z > 0. The shaded 
region is a fundamental domain for the sublattice that lies on the affine span of the 
triangle. 


If S C R? has full dimension d, the relative volume coincides with the 
“full-dimensional” volume. Henceforth, when we write vol S we refer to the 
relative volume of S. With this convention we can rewrite Lemma 3.19 to 
accommodate a set S C R? that is m-dimensional: its relative volume can be 
computed as 


ode il d 
vol S = lim => # (SOZ ). 

In the case that # (tS N Z*) has the special form of a polynomial—for 
example, if S is an integral polytope—we can further simplify this theorem. 
Suppose P C Rt is an integral m-polytope with Ehrhart polynomial 

Lp(t) = cmt” +cem1t + Hea 1, 


Then, according to the above discussion, and much in sync with Lemma 3.19, 
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Cirie 


T ta E A a ere 
t= t™ t—oo tm 
The relative volume of P is the leading term of the corresponding counting 
function Lp. 

For example, in the previous section we found that Corollary 5.5 implies 
that the second leading coefficient cg_; of the Ehrhart polynomial of the 
d-polytope P equals l times the sum of the leading terms of the Ehrhart 
polynomials of the facets of P. The leading term for one facet is simply the 
relative volume of that facet: 


Theorem 5.6. Suppose Lp(t) = cat? + ca-1 t! +- + co is the Ehrhart 
polynomial of the integral polytope P. Then 


1 
Cd-1 = 5 ‘2 vol F. 


F a facet of P 














Notes 


1. The Dehn-Sommerville relations (Theorem 5.1) first surfaced in the work 
of Max Dehn, who proved them in 1905 for dimension five [71]. (The Dehn- 
Sommerville relations are not that complicated for d < 4; see Exercise 5.3.) 
Some two decades later, D. M. Y. Sommerville proved the general case [167]. 
Theorem 5.1 was neither well known nor much used in the first half of the 
twentieth century, but only after its rediscovery by Victor Klee [111] and its 
appearance in Branko Griinbaum’s famous and widely read book [90]. 


2. The Euler relation (Theorem 5.2) is easy to prove directly for d = 3 (this 
case is attributed to Euler), but for higher dimension, one has to be somewhat 
careful, as we already remarked in the text. The classical proof for general d 
was found in 1852 by Ludwig Schlafli [158], although it (like numerous later 
proofs) assumes that the boundary of a convex polytope can be built up 
inductively in a “good” way. This nontrivial fact—which is called shelling of 
a polytope—was proved by Heinz Bruggesser and Peter Mani in 1971 [50]. 
Shellability is nicely discussed in [193, Lecture 8]. There are short proofs of 
the Euler relation that do not use the shelling of a polytope (see, for example, 
[120, 139, 184]). 


3. The reader might suspect that proving Theorems 5.1 and 5.2 for rational 
polytopes suffices for the general case, since it seems that we can transform 
a polytope with irrational vertices slightly to one with only rational vertices 
without changing the face structure of the polytope. This is true in our every- 
day world but fails in dimension > 4 (see [156] for dimension 4 and [193, 
pp. 172/173] for general dimension). 


Exercises 103 


4, Theorem 5.3 is due to Peter McMullen [127], who, in fact, proved this result 
in somewhat greater generality. Another generalization of Theorem 5.3 can be 
found in [59]. 


Exercises 


5.1. Consider a simple 3-polytope with at least five facets. Two players play 
the following game: Each player, in turn, signs his or her name on a previously 
unsigned face. The winner is the player who first succeeds in signing three 
facets that share a common vertex. Show that the player who signs first will 
always win by playing as well as possible.® 


5.2. Show that for the d-cube, fy = 2¢-*(?). 


5.3. Give an elementary proof of the Dehn—Sommerville relations (Theo- 
rem 5.1) for d < 4. 


5.4. & Let P be a simple d-polytope. Prove that the number of k-faces of P 
containing a given j-face of P equals (477). 

5.5. & Show directly, without using Theorem 5.2, that for a d-simplex: 

(a) fe = ($41). 


d 
(b) S0(-1)* fa = 1. 


k=0 





5.6. Prove Theorem 5.1 directly (and hence not requiring P to be an integral 
polytope). (Hint: Orient yourself along the proof of Theorem 5.3, but start 
with the Euler relation (Theorem 5.2) for a given face F instead of (5.1).) 


5.7. Let F be a face of a simple polytope P. Prove that 

di ; 
y (peme s] = (-1)° Gay ba Oy. ial 
GDF E 


5.8. Show that the equations in Theorem 5.3 are equivalent to the following 
identities. If P is a simple lattice d-polytope and k < d, then 


Don (i i) rn = Dn i) Ro. 


j=0 i=k 


6 This was one of the 2002 Putnam contest problems. 
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5.9. Prove that the equations in the previous exercise imply the following 
identities which compare the number of lattice points in faces and relative 
interiors of faces of the simple polytope P: 


revi) > #@nz!) =D (i) E #@ nz"), 


j=0 J FCP GCP 
dim F=d—-j dim G=i 
where k = 0,...,d = dim P. For example, for k = 0, we have 
#(PNZ*) =X #(G9°NZ4), 
GOP 


and for k = d we obtain the inclusion—exclusion formula 


d 


#(P°NZ*)=S\(-1)*4 SO #(FNZ?4). 
j=0 afer 


5.10. Another nice reformulation of Theorem 5.3 is the following generalized 
reciprocity law. For an integral d-polytope P, define the generalized Ehrhart 
polynomial 


y= Ne 5) S Lrt), k=0,...,4 


jg=0 FCP 
dim F=d—j 


Prove the generalized reciprocity law 
E,(—t) = (—1)? Ea-x(t), k=0,...,d, 
which implies Ehrhart—Macdonald reciprocity (Theorem 4.1) when k = 0. 


5.11. What happens when P is not simple? Give an example for which The- 
orem 5.3 fails. 


5.12. Give an alternative proof of Theorem 5.6 by considering Lp(t) — Lpo (t) 
as the lattice-point enumerator of the boundary of P. 


6 





Magic Squares 


The peculiar interest of magic squares and all lusus numerorum in general lies in 
the fact that they possess the charm of mystery. 


W. S. Andrews 





Fig. 6.1. Magic square at the Temple de la Sagrada Família (Barcelona, Spain). 


Equipped with a solid base of theoretical results, we are now ready to return 
to computations. We use Ehrhart theory to assist us in enumerating magic 
squares. 

Loosely speaking, a magic square is an n x n array of integers (usually 
required to be positive, often restricted to the numbers 1,2,...,n?, usually 
required to have distinct entries) whose sum along any row, column, and main 
diagonal is the same number, called the magic sum. Magic squares have turned 
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up time and again, some in mathematical contexts, others in philosophical or 
religious contexts. According to legend, the first magic square (the ancient Luo 
Shu square) was discovered in China sometime before the first century B.C.E. 
on the back of a turtle emerging from a river. It was the square pictured in 
Figure 6.2. 











4) 9] 2 
3°| 5 | 7 
8 | 1 | 6 

















Fig. 6.2. The Luo Shu square. 


Our task in this chapter is to develop a theory for counting certain classes 
of magic squares, which we now introduce. 


6.1 It’s a Kind of Magic 


One should notice that the Luo Shu square has the distinct entries 1,2,...,9, 
so these entries are positive, distinct integers drawn from a particular set. Such 
requirements are too restrictive for our purposes. We define a semimagic 
square to be a square matrix whose entries are nonnegative integers and 
whose rows and columns (called lines in this setting) sum to the same number. 
A magic square is a semimagic square whose main diagonals also add up to 
the line sum. Figure 6.3 shows two examples. 




















3°} 0 | 0 1 2 | 0 
0 1 2 0 1 2 
O | 2 1 2 | 0 T 
































Fig. 6.3. A semimagic and a magic square. 


We caution the reader about clashing definitions in the literature. For 
example, some people reserve the term “magic square” for what we will call 
a traditional magic square, a magic square of order n whose entries are 
the distinct integers 1,2,...,n?. (The Luo Shu square is an example of a 
traditional magic square.) Others are slightly less restrictive and use the term 
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“magic square” for a magic square with distinct entries. We stress that we do 
not make this requirement in this chapter. 

Our goal is to count semimagic and magic squares. In the traditional case, 
this is in some sense not very interesting:' for each order there is a fixed 
number of traditional magic squares. For example, there are 7040 traditional 
4 x 4 magic squares. 

The situation becomes more interesting if we drop the condition of tra- 
ditionality and study the number of magic squares as a function of the line 
sum. We denote the total numbers of semimagic and magic squares of order 
n and line sum t by H,,(t) and M,,(t), respectively. 














t t 
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Fig. 6.4. Semimagic and magic squares for n = 2. 


Example 6.1. We illustrate these notions for the case n = 2, which is not 
very complicated. Here a semimagic square is determined once we know one 
entry, say the upper left one, denoted by Ọ in Figure 6.4. Because of the upper 
row sum, the upper right entry has to be t — Y, as does the lower left entry 
(because of the left column sum). But then the lower right entry has to be 
t — (t — Q) = Q (for two reasons: the lower row sum and the right column 
sum). The entry Q can be any integer between 0 and t. Since there are t + 1 
such integers, 

Aa(t) =t4+1. (6.1) 


In the magic case, we also have to think of the diagonals. Looking back at our 
semimagic square in Figure 6.4, the first diagonal gives 2-0 = t, or 9 = t/2. 
In this case, t — Y = t/2, and so a 2 x 2 magic square has to have identical 
entries in each position. Because we require the entries to be integers, this is 
possible only if t is even, in which case we obtain precisely one solution, the 
square on the right in Figure 6.4. That is, 


1 if t is even, 
MO ‘i if t is odd. 


These easy results already hint at something: the counting function Hp is of 
a different character than the function Mp. 














1 It is, nevertheless, an incredibly hard problem to count all traditional magic 
squares of a given size n. At present, these numbers are known only for n < 5 
[165, Sequence A006052]. 


108 6 Magic Squares 


6.2 Semimagic Squares: Integer Points 
in the Birkhoff-von Neumann Polytope 


Just as the Frobenius problem was intrinsically connected to questions about 
integer points on line segments, triangles, and higher-dimensional simplices, 
magic squares and their relatives have a life in the world of geometry. The 
most famous example is connected to semimagic squares. 

A semimagic n x n square has n? nonnegative entries that sum to the same 
number along any row or column. Consider therefore the polytope 


Til tt Lin 
2 
Br:= : : ER” : aj, >0 


Eni --. Tnn 


Xj tjk =l1forall<k<n 
Xp je = 1 forall l<j<nf’ 


(6.2) 
consisting of nonnegative real matrices, in which all rows and columns sum 
to one. Bn is called the n** Birkhoff-von Neumann polytope, in honor 
of Garrett Birkhoff (1911-1996)? and John von Neumann (1903-1957). Be- 
cause the matrices contained in the Birkhoff-von Neumann polytope appear 
frequently in probability and statistics (the line sum 1 representing probabil- 
ity 1), B, is often described as the set of all n x n doubly stochastic matrices. 


Geometrically, Bn is a subset of R”? and as such difficult to picture once 
n exceeds 1.4 However, we can get a glimpse of Bz C R* when we think about 
what form points in By can possibly attain. Very much in sync with Figure 6.4, 
such a point is determined by its upper left entry Q, pictured in Figure 6.5. 
This entry Q is a real number between 0 and 1, which suggests that Bj should 


Q 1-9 
1-9 Q 
Fig. 6.5. A point in Bo. 


look like a line segment in 4-space. Indeed, the vertices of Bz should be given 
by 9 = 0 and Q = 1, that is, by the points 


10\ (01 
(01) Ga) e® 


? For more information about Birkhoff, see 

http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Birkhoff Garrett.html. 
3 For more information about von Neumann, see 

http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Von_Neumann.html. 
4 The case n = 1 is not terribly interesting: Bı = {1} is a point. 
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These results generalize: B,, is an (n — 1)?-polytope (see Exercise 6.3), whose 
vertices (Exercise 6.5) are the permutation matrices, namely, those n x n 
matrices that have precisely one 1 in each row and column (every other entry 
being zero). For dimensional reasons, we can talk about the continuous volume 
of Bn only in the relative sense, following the definition of Section 5.4. 

The connection of the semimagic counting function H,,(t) to the Birkhoff- 
von Neumann polytope Bn becomes clear in the light of the lattice-point enu- 
merator for Bn: the counting function Hn (t) enumerates precisely the integer 
points in tB,, that is, 


H,(t) =# (18, n z’) = Lg, (t). 


We can say more after noticing that permutation vertices are integer points 
in Bn, and so Ehrhart’s theorem (Theorem 3.8) applies: 











Theorem 6.2. H„(t) is a polynomial in t of degree (n — 1)?. 





The fact that H, is a polynomial—apart from being mathematically 
appealing—has the same nice computational consequence that we exploited 
in Section 3.6: we can calculate this counting function by interpolation. For 
example, to compute Ho, a linear polynomial, we need to know only two val- 
ues. In fact, since we know that the constant term of Hə is 1 (by Corollary 
3.15), we need only one value. It is not hard to convince even a lay person 
that H2(1) = 2 (which two semimagic squares are those?), from which we 
interpolate 

H(t)=t+1. 


To interpolate the polynomial H3, we need to know 4 values aside from 
H3(0) = 1. In fact, we do not even have to know that many values, because 
Ehrhart—Macdonald reciprocity (Theorem 4.1) assists us in computations. To 
see this, let H?(t) denote the number of n x n square matrices with positive 
integer entries summing up to t along each row and column. A moment’s 
thought (Exercise 6.6) reveals that 


H°(t) = Halt- n). (6.3) 


But there is a second relationship between H,, and H$, namely, Hẹ (t) counts, 
by definition, the integer points in the relative interior of the Birkhoff—von 
Neumann polytope Bn, that is, H(t) = Lge (t). Now Ehrhart—Macdonald 
reciprocity (Theorem 4.1) gives 


He (-t) = (-1)-" H, (t). 


Combining this identity with (6.3) gives us a symmetry identity for the count- 
ing function for semimagic squares: 
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Theorem 6.3. The polynomial H, satisfies 
H,(—n—t) = (-1)°-)" Hp (t) 


and 





H,,(—1) = H,(—2) = -= H,(—n +1) =0. 











The roots of H, at the first n — 1 negative integers follow from (Exer- 
cise 6.7) 
H9(1) = H2(2) = = H2(n—1) =0. 


Theorem 6.3 gives the degree of Bn, and it implies that the numerator of the 
Ehrhart series of the Birkhoff-von Neumann polytope is palindromic: 


Corollary 6.4. The Ehrhart series of the Birkhoff-von Neumann polytope Bn 
has the form 





) = R(n—1)(n—2) z(n=1)(n=2) p... ho 


Ehrg,, (z a yea 


? 


where ho, hi,.. <, OA BY € Z>0 satisfy hy = R(n—1)(n—2)—k for0O<k< 
(n—1)(n—2) 
2 


Proof. Denote the Ehrhart series of B,, by 


hin-1)2 2079 +-+- + ho 
(1 = yy ee 





Ehrz,, (z) = 


The fact that h—12 = +++ = A~@-—1)2~-(n-2) = 0 follows from the second 
part of Theorem 6.3 and Theorem 4.5. The palindromy of the numerator 
coefficients follows from the first part of Theorem 6.3 and Exercise 4.6: it 
implies 


1 
Ehrg,, C) = Soe Ehrg, (z), 
z 


which yields hy = h(a—1)(n—2)—k upon simplifying both sides of the equation. 














Let’s return to the interpolation of H3: Theorem 6.3 gives, in addition to 
H3(0) = 1, the values 


H3(-3)=1 and  4Hs3(—1) = H3(—2) =0. 
These four values together with H3(1) = 6 (see Exercise 6.1) suffice to inter- 
polate the quartic polynomial H3, and one computes 


Ag, espe AO see 0 
ae t+1. A 
3° J 8 TG + (6 ) 





H(t) = 
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This interpolation example suggests the use of a computer; we let it calcu- 
late enough values of H, and then simply interpolate. As far as computations 
are concerned, however, we should not get too excited about the fact that 
we computed H> and Hs so effortlessly. In general, the polynomial H, has 
degree (n — 1)?, so we need to compute (n — 1)? + 1 values of H, to be 
able to interpolate. Of those, we know n (the constant term and the roots 
given by Theorem 6.3), so n? — 3n + 1 values of H, remain to be computed. 
Ehrhart—Macdonald reciprocity reduces the number of values to be computed 
to ea +1. That’s still a lot, as anyone can testify who has tried to get a 
computer to enumerate all semimagic 7 x 7 squares with line sum 15. Never- 
theless, it is a fun fact that we can compute H,, for small n by interpolation. 
It is amusing to test one’s computer against the constant-term computation 
we will outline below, and we invite the reader to try both. For small n, in- 
terpolation is clearly superior to a constant-term computation in the spirit 
of Chapter 1. The turning point seems to be right around n = 5: the com- 
puter needs more and more time to compute the values H,,(t) as t increases. 
Methods superior to interpolation are needed. 


6.3 Magic Generating Functions and Constant-Term 
Identities 


Now we will construct a generating function for Hp, for which we will use 
Theorem 2.13. The semimagic counting function Hp is the Ehrhart polynomial 
of the nt? Birkhoff-von Neumann polytope Bn, which, in turn, is defined as a 
set of matrices by (6.2). First we rewrite the definition of Bn to fit the general 
description (2.23) of a polytope. If we consider the points in B, as column 
vectors in R”? (rather than as matrices in R"*”) then 


B, = {x € RS : Ax=b}, 


where 
Lee] 
Teel 
A= Lora ez (6.5) 
1 1 1 
1 1 1 
(here we omit the zero entries) and 
1 
1 
b= ez”, 
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From this description of 6,, we can easily build the generating function 
for H,. According to Theorem 2.13, for a general rational polytope P = 
{x € RZo > Ax= b}, we have 


1 
(1-2) (1 oe} , 


where c1, C2, ..., Cq denote the columns of A. In our special case, the columns 
of A are of a simple form: they contain exactly two 1’s and elsewhere 0’s. We 
need one generating-function variable for each row of A. To keep things as 
clear as possible, we use 21, 22,...,2n for the first n rows of A (representing 
the row constraints of Bn) and w1, w2,...,Wn for the last n rows of A (rep- 
resenting the column constraints of Bn). With this notation, Theorem 2.13 
applied to 6, gives the following starting point for our computations: 





Lp(t) = const ( 


Theorem 6.5. The number H,,(t) of semimagic n x n squares with line sum 
t satisfies 


1 
t 
Thepeea (1 — z; wx) cies Zj icken wr) 


This identity is of both theoretical and practical use. One can use it to compute 
H3 and even H, by hand. For now, we work on refining it further, exemplified 
by the case n = 2. 

We first note that in the formula for Hə, the variables wı and wə are 
separated, in the sense that we can write this formula as a product of two 
factors, one involving only w and the other involving only we: 


Ha) = const (+ : | 


ztz$ (1— z1w1) (1 — z2w1) w$ (1 — z1w2) (1 — z2w2) w$ 

















Hn (t) = const 





Now we put an ordering on the constant-term computation: let us first com- 
pute the constant term with respect to w2, then the one with respect to w. 
(We don’t order the computation with respect to zı and zg just yet.) Since 21, 
z2, and w; are considered constants when we do constant-term computations 
with respect to w2, we can simplify: 





1 1 
Hə(t) = tarza l => tw 
a(t) = consta,,2, (a Se (z — 21w1) (1 — z2w1) wt 


X consti, (z — ZW) a — z2w2) z) 


Now we can see the effect of the separate appearance of wı and w2: the 
constant-term identity factors. This is very similar to the factoring that can 
appear in computations of integrals in several variables. Let us rewrite our 
identity to emphasize the factoring: 





6.3 Magic Generating Functions and Constant-Term Identities 113 





1 1 
H(t) = const -rz const 
2(t) NSbz 22 (z reas (aa) 


1 
tip l 
X CONStws (sds) 


But now the expressions in the last two sets of parentheses are identical, 
except that in one case the constant-term variable is called w and in the 
other case w2. Since these are just “dummy” variables, we can call them w, 
and combine: 


H(t) t ea t : : 
= Const, z consty g 
a PIED Nogt gt (1 — zw) (1 — z2w) wt 


(Note the square!) Naturally, all of this factoring works in the general case, 
and we invite the reader to prove it (Exercise 6.8): 


1 n 
(6.6) 
We can go further, namely, we can compute the innermost constant term 


1 
(1— zw) (1 — znwjwt ` 














constw 


It should come as no surprise that we’ll use a partial fraction expansion to do 
so. The w-poles of the rational function are at w = 1/21, w = 1/z2, ..., W = 
1/zn, and so we expand 





1 Aj Ao An d Bk 
= a] H . (6.7 
(1— zw): (1 — znw)wt gi gea w— + Do ih oe 


Just as in Chapter 1, we can forget about the B,-terms, since they do not 
contribute to the constant term, that is, 

















1 
tw 
a (1— z1w)---(1— znw)wt 
A Aa An 
= const» eck Pye PSs aces 
zı Z2 Za 
= Åızı Apz EFs Aaii . 
We invite the reader to show (Exercise 6.9) that 
t-1 
Ag = d 
eee ee) 
Zk Zk Zk Zk 


t+n—-2 
2k 


Iha- z) 
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Putting these coefficients back into the partial fraction expansion yields the 
following identity. 


Theorem 6.6. The number H,,(t) of semimagic n x n squares with line sum 


t satisfies 
n haa ý 
Hn (t) = const (e "++ Zn) (>: ras) ' 


Amidst all this generality, we almost forgot to compute Hə with our partial 
fraction approach. The last theorem says that 




















zt+1 ARE NA 
H(t) = const | (z122) * ( 1 H2 ) 
217 Z2 Z22721 
t+2_—t —t ,t+2 
= const ( aa goo ea zF wee z) ; (6.9) 
(z1 — 22) (z1 — 22) (z1 — 22) 


Now it’s time to put more order on the constant-term computation. Let’s say 
we first compute the constant term with respect to z1, and after that with 
respect to z2. So we have to compute first 


t+2,-t —t ,t+2 
zi Z 212 2, 2 
const z, (25) , const,, (5) , and const, (E) : 
(z1 — 22) (z1 — 22) (z1 — 22) 
To obtain these constant terms, we need to expand the function G: As 


we know from calculus, this expansion depends on the order of the magnitudes 
of zı and zg. For example, if |z1| < |z2| then 


1 Len 1 z\* ae 
z =z 2 B-1 => (2) ~ Du eH At 











22 k>0 k>0 “2 
and hence 
1 d ( 1 ) ye k py k+l , 
= => vA — ae ae a 
1 5 1 
(z1 — 22)? dz \z1— 22 ET oo = re 


So let’s assume for the moment that |z1| < |z2|. This might sound funny, since 
zı and zg are variables. However, as such they are simply tools that enable 
us to compute some quantity that is independent of zı and z2. In view of 
these ideas, we may assume any order of the magnitudes of the variables. In 
Exercise 6.11 we will check that indeed the order does not matter. Now, 
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Pg (etc bs ns ee 


k+1 
= z3 * const; | 24°" 5 E ze (6.10) 
k>0 72 





z k+1 
= z3 * const z X meaty 


k>0 72 





=0, 


since there are only positive powers of zı (remember that t > 0). Analogously 
(see Exercise 6.10) one checks that 


constz, (3) =0. (6.11) 


21 — 22 


For the last constant term, we compute 


—t _t+2 
D -5 k+1 
const, i “2 _ | = t? const , gt ——— zh 
1 2 2 i 1 k+2 ^1 
(z1 — 22) k>0 *2 


+ k+1 
t+2 k—t 
= Z3 “ const 2, X A 


k>0 ^2 





The constant term on the right-hand side is the term with k = t, that is, 
—t t+2 
Zz t+1 
const L12 igh he ee ep, 
3i & = a E gte 


So of the three constant terms only one survives, and with const», (t+1) = t+1 
we recover what we have known since the beginning of this chapter: 


t+2,-t -t t+2 
Bi 2a 2122 21 22 J= 


(z1= 22)? (z122) (2-22)? 








H(t) = const ( 


This was a lot of work for this seemingly trivial polynomial. Recall, for 
example, that we can get the same result by an easy interpolation. How- 
ever, to compute a similar interpolation, e.g., for H4, we likely would need 
to use a computer (to obtain the interpolation values). On the other hand, 
the constant-term computation of H4 boils down to only five iterated con- 
stant terms, which can actually be computed by hand (see Exercise 6.14). 
The result is 

= H č g Mig, 19 y 2 1109 5 434 35117 3 

H(t) = zt tit t E t 





j t 
630 135 3 540 10 5670 

+ eg + E 
63 18 i 
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6.4 The Enumeration of Magic Squares 


What happens when we bring the diagonal constraints, which are not present 
in the semimagic case, into the magic picture? In the introduction of this 
chapter we have already seen an example, namely the number of 2 x 2 magic 
squares, 


1 iftis even, 
MNO) = 3 if t is odd. 


This is a very simple example of a quasipolynomial. In fact, like H,,, the 
counting function Mnp is defined by integral linear equations and inequalities, 
so it is the lattice-point enumerator of a rational polytope, and Theorem 3.23 
gives at once the following result. 











Theorem 6.7. The counting function M,,(t) is a quasipolynomial in t. 





We invite the reader to prove that the degree of Mn is n? — 2n — 1 (Exer- 


cise 6.16). 
Let’s see what happens in the first nontrivial case, 3 x 3 magic squares. 
We follow our recipe and assign variables m1, Mz, ...,mMg to the entries of our 


3 x 3 squares, as in Figure 6.6. 





mi | M2 | M3 





ma | m5 | M6 




















Fig. 6.6. Variables in a magic 3 x 3 square. 


The magic conditions require now that m1, M2,..., Mg € Z>o and 


Mı +M + mMm =t, 
m4 + m5 +m =t, 
mr + msg + mg =t, 


mı +m +m =t, 





Mz + M5 +M =t, 
m3 +me +mMg =t, 


mı +m5 +mMmg =t, 





m3 +m5 +m7=t, 


according to the row sums (the first three equations), the column sums (the 
next three equations), and the diagonal sums (the last two equations). By 
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now, we’re experienced in translating this system into a generating function: 
we need one variable for each equation, so let’s take 21, z2,z3 for the first 
three, w1, w2, w3 for the next three, and y1, y2 for the last two equations. The 
function M3(t) is the constant term of 


1 
(1 = Z1w1Yy1) (1 ae 21 We) (1 = 21 W3Y2) (1 = 221) (1 = Z2W2Y1Y2) (1 = 22W3) 


1 
: (6.12) 
(1 — z3w1y2) (1 — z3we) (1 — z3w3y1) (212223W1W2W3y1Y2) 








It does take some work, but it is instructive to compute this constant term 
(just try it!). The result is 


24242441 if 3]/t 
Mag ane ars Beale. (6.13) 
0 otherwise. 


As predicted by Theorem 6.7, M3 is a quasipolynomial. It has degree 2 and 
period 3. This may be more apparent if we rewrite it as 


22 4+2¢+1 ift=0 mod 3, 
M3(t) = < 0 ift=1 mod 3, 
0 if t = 2 mod 3, 


and we can see the three constituents of the quasipolynomial M3. There is 
an alternative way to describe M3; namely, let 


2 if t = 0 mod 3, 
co(t)=40 ift=1 mod 3, 
0 ift=2 mod 3, 
Z if t = 0 mod 3, 
a(t)= 40 ift=1 mod 3, 
0 ift=2 mod 3, 
1 ift=0 mod 3, 
co(t)= 40 ift=1 mod 3, 
0 ift=2 mod 3. 


Then the quasipolynomial M3 can be written as 


M(t) = (t) +c (t)t + a(t). 


Notes 


1. Magic squares date back to China in the first millennium B.C.E. [53]; they 
underwent much further development in the Islamic world late in the first 
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millennium C.E. and in the next millennium (or sooner; the data are lacking) 
in India [54]. From Islam they passed to Christian Europe in the later Middle 
Ages, probably initially through the Jewish community [54, Part II, pp. 290 
ff.] and later possibly Byzantium [54, Part I, p. 198], and no later than the 
early eighteenth century (the data are buried in barely tapped archives) to 
sub-Saharan Africa [191, Chapter 12]. The contents of a magic square have 
varied with time and writer; usually they have been the first n? consecutive 
integers, but often any arithmetic sequence or arbitrary positive numbers. 
In the past century, mathematicians have made some simplifications in the 
interest of obtaining results about the number of squares with a fixed magic 
sum, in particular, allowing repeated entries as in this chapter. 


2. The problem of counting magic squares (other than traditional magic 
squares) seems to have occurred to anyone only in the twentieth century, 
no doubt because there was no way to approach the question previously. The 
first nontrivial formulas addressing the counting problem, namely (6.4) and 
(6.13) for Hz and M3, were established by Percy Macmahon (1854—1929)° 
[124] in 1915. Recently there has grown up a literature on exact formulas (see 
for example [80, 168] for semimagic squares; for magic squares see [1, 23]; for 
magic squares with distinct entries see [31, 189]). 


3. Another famous kind of square is latin squares (see, for example, [72]). 
Here each row and column has n different numbers, the same n numbers in 
every row/column (usually taken to be the first n positive integers). There 
are counting problems associated with latin squares, which can be attacked 
using Ehrhart theory [31] (see also [165, Sequence A002860]). 


4. Recent work includes mathematical-historical research, such as the discov- 
ery of unpublished magic squares of Benjamin Franklin [2, 141]. Aside from 
mathematical research, magic squares and their siblings naturally continue 
to be an excellent source of topics for popular mathematics books (see, for 
example, [4] or [144]). 


5. The Birkhoff-von Neumann polytope Bn possesses fascinating combinato- 
rial properties [38, 48, 49, 58, 192] and relates to many mathematical areas 
[74, 112]. Its name honors Garrett Birkhoff and John von Neumann, who 
proved that the extremal points of Bn are the permutation matrices [39, 186] 
(see Exercise 6.5). A long-standing open problem is the determination of 
the relative volume of Bn, which is known only for n < 10 [165, Sequence 
A037302]. In fact, the last two records (n = 9 and 10) for computing vol Bn 
rely on the theory of counting functions that is introduced in this book, more 
precisely, Theorem 6.6 [28]. 


5 For more information about MacMahon, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/MacMahon.html. 
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6. An important generalization of the Birkhoff-von Neumann polytope is the 
transporation polytopes, which consist of contingency tables. They have appli- 
cations to statistics and in particular to disclosure limitation procedures [68]. 
The Birkhoff-von Neumann polytopes are special transportation polytopes 
that consist of two-way contingency tables with given 1-marginals. 


7. The polynomiality of H, (Theorem 6.2) and its symmetry (Theorem 6.3) 
were conjectured in 1966 by Harsh Anand, Vishwa Dumir, and Hansraj Gupta 
[3] and proved seven years later independently by Eugéne Ehrhart [80] and 
Richard Stanley [168]. Stanley also conjectured that the numerator coefficients 
in Corollary 6.4 are unimodal, a fact that was proved only in 2005, by Christos 
Athanasiadis [8]. The quasipolynomiality of M,, (Theorem 6.7) and its degree 
are discussed in [23]. The period of M,, is in general not known. In [23] it 
is conjectured that it is always nontrivial for n > 1. The work in [1] gives 
some credence to this conjecture by proving that the polytope of magic n x n 
squares is not integral for n > 2. 


8. We close with a story about Cornelius Agrippa’s De Occulta Philosophia, 
written in 1510. In it he describes the spiritual powers of magic squares and 
produces some squares of orders from three up to nine. His work, although 
influential in the mathematical community, enjoyed only brief success, for 
the Counter-Reformation and the witch hunts of the Inquisition began soon 
thereafter: Agrippa himself was accused of being allied with the devil. 


Exercises 


6.1. & Find and prove a formula for H,,(1). 
6.2. Let (ij) <j ;<3 be a magic 3 x 3 square. 


(a) Show that the center term £29 is the average over all 2;;. 
(b) Show that M3(t) = 0 if 3 does not divide t. 


6.3. & Prove that dim Bn = (n — 1)?. 


6.4. Prove the following characterization of a vertex of a convex polytope P: A 
point v € P is a vertex of P if for any line L through v and any neighborhood 
N of v there exists a point in LM N that is not in P. 


6.5. & Prove that the vertices of Bn are the n x n-permutation matrices. 


6.6. & Let H? (t) denote the number of n x n matrices with positive integer 
entries summing up to t along each row and column. Show that Hẹ (t) = 
H(t —n) fort >n. 


6.7. & Show that H°(1) = H°(2) = --- = H(n—1) =0. 
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6.8. & Prove (6.6): 


Ha (t) = consta... a(o ezn)” (const cos 7 — z) ) . 


6.9. & Compute the partial fraction coefficients (6.8). 
6.10. & Verify (6.11). 


6.11. Repeat the constant-term computation of Hə starting from (6.9), but 
now by first computing the constant term with respect to z2, and after that 
with respect to 21. 





6.12. Use your favorite computer program to calculate the formula for H3(t), 
H4(t),... by interpolation. 


6.13. Compute H3 using Theorem 6.6. 
6.14. Compute H, using Theorem 6.6. 


6.15. Show that 
gern 1 


2r - 


jek Zk — ži) ae 
and use this identity to give an alternative proof of Theorem 6.6. 
6.16. & Prove that for n > 3, the degree of Mn is n? — 2n — 1. 
6.17. Compute the vertices of the polytope of 3 x 3 magic squares. 
6.18. & Verify (6.12) and use it to compute M3. 
6.19. Compute M3 by interpolation. (Hint: Use Exercises 6.2 and 6.17.) 


6.20. A symmetric semimagic square is a semimagic square that is a sym- 
metric matrix. Show that the number of symmetric semimagic n x n squares 
with line sum t is a quasipolynomial in t. Determine its degree and period. 


Open Problems 


6.21. Compute the number of traditional magic n x n squares for n > 5. 
6.22. Compute vol Bn for n > 10. Compute H,, for n > 9. 
6.23. Prove that the period of Mn is nontrivial for n > 1. 


6.24. The vertices of the Birkhoff-von Neumann polytope are in one-to-one 
correspondence with the elements of the symmetric group Sn. Consider a sub- 
group of Sn and take the convex hull of the corresponding permutation matri- 
ces. Compute the Ehrhart polynomials of this polytope. (The face numbers of 
the polytope corresponding to the subgroup An, the even permutations, were 
studied in [102].) 

6.25. Prove that the graph formed by the vertices and edges of any 2-way 
transportation polytope is Hamiltonian. 
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Finite Fourier Analysis 


God created infinity, and man, unable to understand infinity, created finite sets. 


Gian-Carlo Rota (1932-1999) 


We now consider the vector space of all complex-valued periodic functions 
on the integers with period b. It turns out that every such function a(n) on 
the integers can be written as a polynomial in the bt? root of unity €” := 
e27n/> Such a representation for a(n) is called a finite Fourier series. 
Here we develop the finite Fourier theory using rational functions and their 
partial fraction decomposition. We then define the Fourier transform and the 
convolution of finite Fourier series, and show how one can use these ideas to 
prove identities on trigonometric functions, as well as find connections to the 
classical Dedekind sums. 

The more we know about roots of unity and their various sums, the deeper 
are the results that we can prove (see Exercise 7.19); in fact, certain statements 
about sums of roots of unity even imply the Riemann hypothesis! However, 
this chapter is elementary and draws connections to the sawtooth functions 
and Dedekind sums, two basic sums over roots of unity. The general philos- 
ophy here is that finite sums of rational functions of roots of unity are basic 
ingredients in many mathematical structures. 


7.1 A Motivating Example 
To ease the reader into the general theory, let’s work out the finite Fourier 
series for a simple example first, an arithmetic function with a period of 3. 
Example 7.1. Consider the following arithmetic function, of period 3: 
n:0,1,2,3,4,5,... 
a(n) :1,5,2,1,5,2,... 
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We first embed this sequence into a generating function as follows: 


F(z) = 1452422? + 2345244222 +---=S a(n) 2”. 
n>0 


Since the sequence is periodic, we can simplify F(z) using a geometric series 
argument: 


n>0 
= 1+ 5z +22? +2? (1+ 5z +22?) +26 (1 +52 +22?) +- 
= (1 +52 + 22°) X 2% 
k>0 
1452+ 227 
E 1-23 
We now use the same technique that was employed in Chapter 1, namely the 
technique of expanding a rational function into its partial fraction decompo- 
sition. Here all the poles are simple, and located at the three cube roots of 
unity, so that 
re) = 2 4 an a2) 
l-z l-pz 1-22 





(7.1) 
where the constants G(0), @(1), @(2) remain to be found, and where p := e?7*/9, 
a third root of unity. Using the geometric series for each of these terms sepa- 
rately, we arrive at 


F(z) = >> (40) + G(1)p + 4(2)p”) 2”, 


n>0 


so that we’ve derived the finite Fourier series of our sequence a(n)! The only 
remaining piece of information that we need is the computation of the con- 
stants â(j), for j = 0,1,2. It turns out that this is also quite easy to do. We 
have, from (7.1) above, the identity 


a(0) (1 — pz) (1 — p?z) + (1) (1 — z) (1— pz) + (2) (1 — z) (1 — pz) 
=14+5z2+4+ 227, 





valid for all z € C. Upon letting z = 1, p?, and p, respectively, we obtain 


3a(0) =14+542, 
34(1) = 1+ 5p? + 29%, 
34(2) =1+5p + 2p’, 
where we’ve used the identity (1 — p)(1 — p°) = 3 (see Exercise 7.2). We can 


simplify a bit to get â(0) = §, a(1) = =, and a(2) = —4#". Thus the 
finite Fourier series for our sequence is 
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8 4 n 1 2n 
a(n)= E =3 7P)? + -3 tP p”. 


The object of the next section is to show that this simple process follows just 
as easily for any periodic function on Z. The ensuing sections contain some 
applications of the finite Fourier series of periodic functions. 














7.2 Finite Fourier Series for Periodic Functions on Z 


The general theory is just as easy conceptually as the example above, and we 
now develop it. Consider any periodic sequence on Z, defined by {a(n)}°o, of 
period b. Throughout the chapter, we fix the bt? root of unity £ := e?7*/°. As 
before, we embed our periodic sequence {a(n)}°2_) into a generating function, 


n>0 


and use the periodicity of the sequence to immediately get 


b—1 b—1 b-1 
F(z) = (= a(k) “| a ps a(k) “| z? + (£ a(k) “| yb. 





k=0 k=0 k=0 
b-1 
ki a(k) z" a P(z) 
1—2? 1— 2? 
where the last step simply defines the polynomial P(z) = i a(k) z*. Now 


we expand the rational generating function F(z) into partial fractions, as 
before: 





b—1 e 
ro- 8 “S. 
m=0 


As in the example of the previous section, we expand each of the terms 
as a geometric series, and substitute into the sum above to get 


pe ee 
1-E™z 


biis ob 
F(z)= 5 a(n) z” = 5 ae 
n>0 m=0 
b—1 b—1 
= a(m) ee (x a(m) =) rab 
m=0 n>0 n>0 \m=0 


Comparing the coefficients of any fixed z” gives us the finite Fourier series for 
a(n), namely 
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We now find a formula for the Fourier coefficients G(n), as in the example. 
To recapitulate, we have 


b-1 pape l 
Pl) => alal emg => âm) [I G-é2), 
m=0 m=0 1<k<b,keém 


where we have used the factorization 1 — z2? = ma — €*z) of Exercise 7.1. 
To solve for P(€~”), we note that 





ie eG. a #0 mod b 
JP i oem if m-n mod b, 
and 
j= b b—1 
lim a a n o, 


z= gn 1— ENz as Em 


b-1 


1 — 
a(n) = FPE”) = 5 ale, 
k=0 
We have just proved the main result of finite Fourier series, using only ele- 
mentary properties of rational functions: 


Theorem 7.2 (Finite Fourier series expansion and Fourier inver- 
sion). Let a(n) be any periodic function on Z, with period b. Then we have 
the following finite Fourier series expansion: 


b-1 
a(n) =) â(k) E", 
k=0 
where the Fourier coefficients are 
yea 
a(n) = Sather, (7.2) 
k=0 





with E = e277/?, 











The coefficients @(m) are known as the Fourier coefficients of the func- 
tion a(n), and if G(m) 4 0 we sometimes say that the function has fre- 
quency m. The finite Fourier series of a periodic function provides us with 
surprising power and insight into its structure. We are able to analyze the func- 
tion using its frequencies (only finitely many), and this window into the fre- 
quency domain becomes indispensable for computations and simplifications. 
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We note that the Fourier coefficients â(n) and the original sequence ele- 
ments a(n) are related by a linear transformation given by the matrix 


T= (en) l (7.3) 


where 1 < i,j < b, as is evident from (7.2) in the proof above. We further note 
that the second half of the proof, namely solving for the Fourier coefficients 
â(n), is just tantamount to inverting this matrix L. 

One of the main building blocks of our lattice-point enumeration formulas 
in polytopes is the sawtooth function, defined by 


_ J{æ}-4 see Z, 
c= t ifx eZ. (ra 


(As a reminder, {x} = x — |x] is the fractional part of x.) The graph of this 
function is displayed in Figure 7.1. We have seen a closely related function 
before, in Chapter 1, in our study of the coin-exchange problem. Equation 
(1.8) gave us the finite Fourier series for essentially this function from the 
discrete-geometry perspective of the coin-exchange problem; however, we now 
compute the finite Fourier series for this periodic function directly, pretending 
that we do not know about its other life as a counting function. 


Ait 


Fig. 7.1. The sawtooth function y = (£). 








Lemma 7.3. The finite Fourier series for the discrete sawtooth function 
a 


((¢)), a periodic function of a € Z with period b, is given by 
b-1 . b-1 
a i 1+é* ak i TK oak 
(5)) =a Lines a i 


Here the second equality follows from eae = icot(7x), by the definition 
of the cotangent. 
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Proof. Using Theorem 7.2, we know that our periodic function has a finite 
: x a b-1 a ak 
Fourier series ((¢)) = $Y }—o @(k) €*, where 


We first compute â(0) = 4 1°) ()) = 0, by Exercise 7.14. For k # 0, we 
have 





pls a 1 b—1 1 
a —mk —mk 
= 5 > (Fs) eae Let 
m=1 m=1 
Be ea air i So Ss 
= b\L-€& 2) 2b1-—€F’ 











where we used Exercise 7.5 in the penultimate equality above. 





We define the Dedekind sum by 


TEOG] 


for any two relatively prime integers a and b > 0. Note that the Dedekind 
sum is a periodic function of the variable a, with period b, by the periodicity 
of the sawtooth function. That is, 


s(a + jb, b) = s(a, b) for all j € Z. (7.5) 


Using the finite Fourier series for the sawtooth function, we can now easily 
reformulate the Dedekind sums as a finite sum over the bt! roots of unity or 
cotangents: 


Lemma 7.4. 





1+é4 14+€- Tua 
s(a, b) -pbe -gra = p Dea oo T 


Proof. 
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We note that the last sum ee, ktua) vanishes, unless v = —ua mod b 
(Exercise 7.6), in which case the sum equals b, and we obtain 


L+ ee 1 Erne 
Ags) -pEr ene ma ` 





Rewriting the right-hand side in terms of cotangents gives 


15 =e cot = F cot = ib pe cot = — F cot HE a 


because the cotangent is an odd function. 

















7.3 The Finite Fourier Transform and Its Properties 


Given a periodic function f on Z, we have seen that f possesses a finite 
Fourier series, with the finite collection of Fourier coefficients that we called 
f0), fa ) f — 1). We now regard f as a function on the finite set 
G = {0,1,2,...,b— 1}, and let Vg be the vector space of all complex-valued 
functions on G. Equivalently, Vg is the vector space of all complex-valued, 
periodic functions on Z with period b. 

We define the Fourier transform of f, denoted by F(f), to be the peri- 
odic function on Z defined by the sequence of uniquely determined values 


f(0), fO), f(@-1). 


Thus . 
F(f)(m) = f(m). 


Theorem 7.2 above gave us these coefficients as a linear combination of the 
values f(k), with k = 0,1,2,...,b— 1. Thus F(f) is a linear transformation 
of the function f, thought of as a vector in Vg. In other words, we’ve shown 
that F(f) is a one-to-one and onto linear transformation of Vg. 

The vector space Vg is a vector space of dimension b; indeed, an explicit 
basis can easily be given for Vg using the “delta functions” (see Exercise 7.7) 
defined by 


Sin (cr) 1 ifx=m + kb, for some integer k, 
m\T) = 
0 otherwise. 


In other words, 6,,(x) is the periodic function on Z that picks out the arith- 
metic progression {m + kb: k € Z}. 

But there is another natural basis for Vg. For any fixed integer a the roots 
of unity {ea(x) := e272*/ : x € Z} can be thought of as a single function 
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e,(x) E€ Va because of its periodicity on Z. As we saw in Theorem 7.2, the 
functions {e;(),...,e,(x)} give a basis for the vector space of functions Vg. 
A natural question now arises: how are the two bases related to each other? 
An initial observation is 


64(n) = mL 


which simply follows from the computation 


—kn —an l —2mian 
ilk = ig See 


Z i 


ôa(n) = aS 


> 
Il 


0 


So to get from the first basis to the second basis, we need precisely the finite 
Fourier transform! 
It is extremely useful to define the following inner product on this vector 


space: 
b—1 

9) =X f(k) glk), (7.6) 
k=0 


for any two functions f,g € Va. Here the bar denotes complex conjugation. 
The following elementary properties show that (f,g) is an inner product (see 
Exercise 7.8): 


1. (f, f) => 0, with equality if and only if f = 0, the zero function. 
2. (f,9) = (9, f)- 
Equipped with this inner product, Vg can now be regarded as a metric 
space. We can now measure distances between any two functions, and in 


particular between any two basis elements eg(2) := €27**/? and e(z) := 
e nicio; Any positive definite inner product gives rise to the distance function 


d(f,g9)=(f-9, f- 9). 


Lemma 7.5 (Orthogonality relations). 





1 ifb|(a-c), 
0 otherwise. 


ole 


(ea, €c) = ĝa (c) = f 


Proof. We compute the inner product 





b—1 b—1 
lea, ec) =) e,(m = err il a— c)m/b. 
m=0 m=0 


If b | (a—c), then each term equals 1 in the latter sum, and hence the sum 
equals b. This verifies the first case of the lemma. 

If bt (a—c), then ea—e(m) = e?7™la-°)/b is a nontrivial root of unity, and 
we have the finite geometric series 
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b-1 2rim(a—c) 
brita ejr a =] 


=0, 


2rim(a—c) 


b — 1 


m=0 











verifying the second case of the lemma. 





Example 7.6. We recall the sawtooth function again, since it is one of the 
building blocks of lattice point enumeration, and compute its Fourier trans- 
form. Namely, we define 


k A E eZ, 
Bík):=((2))= {E-a i è ¢ Z, 
b 0 if ġ € Z, 
a periodic function on the integers with period b. What is its finite Fourier 
transform? We have already seen the answer, in the course of the proof of 
Lemma 7.3: f 
a 11+6E i TN 


= = t 
Bin) = pi ee a 


for n # 0, and B(0) = 0. As always, € = e2ri/d. 

















In the next section we delve more deeply into the behavior of this inner 
product, where the Parseval identity is proved. 


7.4 The Parseval Identity 


A nontrivial property of the inner product defined above is the following iden- 
tity, linking the “norm of a function” to the “norm of its Fourier transform.” It 
is known as the Parseval identity, and also goes by the name of the Plancherel 
theorem. 


Theorem 7.7 (Parseval identity). For all f € Vo, 


ira): 


Proof. Using the definition e,,(x) = €”* and the relation 





from Theorem 7.2, we have 
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Naa u 
=5 NOYO f(m) enk) X f(r) en(k) 
k=0 m=0 n=0 
Eo 
= 5 FO) FH) (emen) 
m=0 n=0 
ees 
=F OY) FO) dnl) 
m=0 n=0 
1 
= rate f) ’ 


where the essential step in the proof was using the orthogonality relations 
(Lemma 7.5) in the fourth equality above. 














A basically identical proof yields the following stronger result, showing that 
the “distance between any two functions” is essentially equal to the “distance 
between their Fourier transforms.” 


Theorem 7.8. For all f,g E€ Va, we have 
(f,9) = (f,9). 


Example 7.9. A nice application of the generalized Parseval identity above 
now gives us Lemma 7.4 very quickly, the reformulation of the Dedekind sum 
as a sum over roots of unity. Namely, we first fix an integer a relatively prime 
to b and define f(k) = ((£)), and g(k) = ((4¢)). Then, using Example 7.6, we 


have f(n) = + cot ™. To find the Fourier transform of g we need an extra 
2b b g 


twist. Since 
(F)) -a bape 


we can multiply each index m by a71, the multiplicative inverse of a modulo 
b (recall that we require a and b to be relatively prime for the reformulation 
of the Dedekind sum). Since a! is relatively prime to b, this multiplication 
just permutes m = 1,2,...,b—1 modulo b, but the sum stays invariant (see 
Exercise 1.9): 














b-1 b-1 


=I -1 
mka __ TAMA ma` tka it rma mk 
2 cot 22 —é = > cot —; 3 = > cot 5 Er 


m=1 m=l1 





that is, 
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Hence Theorem 7.8 immediately gives us the reformulation of the Dedekind 


woo BODE) 














m=1 
b-1 
1 m amma! 
= t — cot 
4b 2 PD b 
b-1 
= = cot = ot aie 
~ Ab = b b 











For the last equality, we again used the trick of replacing m by ma. 





7.5 The Convolution of Finite Fourier Series 


Another basic tool in finite Fourier analysis is the convolution of two finite 
Fourier series. Namely, let f(t) = 7 ee ax €* and g(t) = aa eee 
where £ = e?™t/b. We define the convolution of f and g by 


b-1 


(f*a)(t)= >, FE- mgm). 


0 


| 


3 
l 


Indeed, it is this convolution tool (the proof of the convolution theorem be- 
low is almost trivial!) that is responsible for the fastest known algorithm for 
multiplying two polynomials of degree b in O(blog(b)) steps (see the Notes at 
the end of this chapter). 


Theorem 7.10 (Convolution theorem for finite Fourier series). Let 
ft =t a ap ¿"t and g(t) = ¢ ge cr &**, where € = e?/. Then their 
convolution satisfies 


1 b—1 
(Peg) == X acp. 
k=0 


Proof. The proof is straightforward: we just compute the left-hand side, and 
obtain 
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b-1 1 b—1 b—1 b—1 
m=0 m=0 \k=0 l=0 
1 b—1 b—1 — 
=z AKC] s pare) 
k=0 1=0 m=0 
b-1 
1 kt 
= 7) akk”, 
b k=0 


because the sum wees ie yanib, unless l = k (see Exercise 7.6). In 


the case that | = k, we have Yt 4 €¢-*)™ = b, 














It is an easy exercise (Exercise A to show that this convolution theorem 
is equivalent to the following statement: 


F(f *g) =bF(f)F(g). 


Note that the proof of Theorem 7.10 is essentially identical with the proof of 
Lemma 7.4 above; we could have proved the lemma, in fact, by applying the 
convolution theorem. We now show how Theorem 7.10 can be used to derive 
identities on trigonometric functions. 


Example 7.11. We claim that 


So (**) - (b— Dee 2) 


The sum suggests the use of the convolution theorem, with a function whose 
Fourier coefficients are a, = cpr = cot 7E zk . But we already know such a function! 
It is just the sawtooth function 26 (2 a), Therefore 


A eaa 


where the equality follows from Theorem 7.10. On setting t = 0, we obtain 


O 











1 b—1 cS 1 

ee Se 2 j 

5 po poe AU 1) 
(b—1)(b—2) 

E 12b , 


as desired. We used the identity (=) = (=) in the first equality above, 
and some algebra was used in the last equality. Notice, moreover, that the 
convolution theorem gave us more than we asked for, namely an identity for 
every value of t. 
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Notes 


1. Finite Fourier analysis offers a wealth of applications and is, for example, 
one of the main tools in quantum information theory. For the reader inter- 
ested in going further than the humble beginnings outlined in this chapter, 
we heartily recommend Audrey Terras’s monograph [180]. 


2. The Dedekind sum is our main motivation for studying finite Fourier series, 
and in fact, Chapter 8 is devoted to a detailed investigation of these sums, in 
which the Fourier—-Dedekind sums of Chapter 1 also finally reappear. 


3. The reader may consult [116, p. 501] for a proof that two polynomials of 
degree N can be multiplied in O(N log N) steps. The actual proof of this fact 
runs along the following conceptual lines. First, let the two given polynomials 
of degree N be f(x) = eae a(n)x” and g(x) = ae b(n)a”. Then we know 
that f(€) and g(&) are two finite Fourier series, and we abbreviate them by 
f and g, respectively. We now note that fg = F(F~'(f) * F~+(g)). If we 
can compute the finite Fourier transform (and its inverse) quickly, then this 
argument shows that we can multiply polynomials quickly. It is a fact of life 
that we can compute the Fourier transform of a periodic function of period 
N in O(N log N) steps, by an algorithm known as the fast Fourier transform 
(again, see [116] for a complete description). 


4. The continuous Fourier transform, defined by f°. f(t)e~?""*dt, can be 
related to the finite Fourier transform in the following way. We approximate 
the continuous integral by discretizing a large interval [0,a]. Precisely, we let 
A := §, and we let t := kA = sa, Then 


a b 
I fe 2? di PS 5 f(tp)e Fer í 
0 k=1 


a finite Fourier series for the function f(¢2) as a function of x € Z. Hence 
finite Fourier series find an application to continuous Fourier analysis as an 
approximation tool. 


Exercises 
Throughout the exercises, we fix an integer b > 1 and let € = e?7*/°. 
7.1. & Show that 1— x? = JJ? (1 — & x). 


7.2. & Show that JJZ} (1 — £) = b. 
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7.3. Consider the matrix that came up in the proof of Theorem 7.2, namely 
L = (aij), with aj; := €°-YGO- and with 1 < i,j < b. Show that the matrix 
ae is a unitary matrix (recall that a matrix U is unitary if U*U = I, where 


U* is the conjugate transpose of U). Thus, this exercise shows that the Fourier 
transform of a periodic function is always given by a unitary transformation. 


7.4. Show that det (41) | = 1, where |z| denotes the norm of the complex 


number z. (It turns out the det(L) can sometimes be a complex number, but 
we will not use this fact here.) 


7.5. & For any integer a relatively prime to b, show that 


b—1 
1 (S 
= X hea = ' 
b E 1— ¿e 


7.6. & Let n be an integer. Show that the sum Da E” vanishes, unless 
n = 0 (mod b), in which case it is equal to b. 





7.7. @& For an integer m, define the delta function 6,,(x) by 


Ho 1 if x= m +ab, for some integer a, 
qt) = 
K 0 otherwise. 


The b functions ô (x), ..., (x) are clearly in the vector space Vg, since they 
are periodic on Z with period b. Show that they form a basis for Vg. 


7.8. & Prove that for all f,g € Va: 
(a) (f, f) > 0, with equality if and only if f = 0, the zero function. 
(b) (f,9) = (9, f)- 


7.9. Show that 4-1 par = 454. 








7.10. Show that (ôa, 6c) = ĝa (C). 
7.11. Prove that (f x 6a)(x) = f(x — a). 


7.12. Prove that ĝa * dc = da+e(mod b): 


Qriax 





7.13. Prove that f(t —a) = fla)je 
7.14. & For any real number x, prove that (x) = Sa (==). 


7.15. If x is not an integer, show that DE cot (r242) = bcot (rz). 
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7.16. For any integer a relatively prime to b, show that 
atl _ 1 
Lenea’ 
TENE 


where the sum is taken over all b! roots of unity € except € = 1. 
7.17. We call a root of unity e?7'*/ a primitive b root of unity if a is 
relatively prime to b. Let p(x) denote the polynomial with leading coefficient 
1 and of degree ¢(b)! whose roots are the ¢(b) distinct primitive bt” roots of 
unity. This polynomial is known as the cyclotomic polynomial of order b. Show 


that 
[[ 2a) = a = 1, 
d|b 
where the product is taken over all positive divisors d of b. 
7.18. We define the Mobius u-function for positive integers n by 
ifn= 1, 


1 
p(n) = 40 if n is divisible by a square, 
(—1)* ifn is square-free and has k prime divisors. 


Deduce from the previous exercise that 


,(2) = Il (a = pele 


d|b 


7.19. Prove that for any positive integer b, 
5 e2ria/b = u(b) i 
1<a<b,(a,b)=1 
the Mobius p-function. 
k 


7.20. Show that for any positive integer k, s(1,k) = —4 + a t 


7.21. Show that Ņ}—] tan? (ZE) = b(b — 1). 
7.22. & Show that Theorem 7.10 is equivalent to the following statement: 
F(f * g) = bF(f)F(g). 


7.23. Consider the trace of the linear transformation L = (0700-9), de 
fined in (7.3). The trace of L is G(b) := aur é™ known as a Gauß sum. 
Show that |G(b)| = vb if b is an odd prime. 


1 6(b) := # {k € [1,b — 1] : (k,b) = 1} is the Euler ¢-function. 
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Dedekind Sums, the Building Blocks 
of Lattice-point Enumeration 


If things are nice there is probably a good reason why they are nice: and if you don’t 
know at least one reason for this good fortune, then you still have work to do. 


Richard Askey 


We’ve encountered Dedekind sums in our study of finite Fourier analysis and 
we became intimately acquainted with their siblings in our study of the coin- 
exchange problem in Chapter 1. They have one shortcoming, however (which 
we'll remove): the definition of s(a,b) requires us to sum over b terms, which 
is rather slow when b = 219°, for example. Luckily, there is a magical reci- 
procity law for the Dedekind sum s(a,b) that allows us to compute it in 
roughly log,(b) = 100 steps. This is the kind of magic that saves the day 
when we try to enumerate lattice points in integral polytopes of dimensions 
d < 4. There is an ongoing effort to extend these ideas to higher dimensions, 
but there is much room for improvement. In this chapter we focus on the 
computational-complexity issues that arise when we try to compute Dedekind 
sums explicitely. 


8.1 Fourier—Dedekind Sums and the Coin-Exchange 
Problem Revisited 


Recall from Chapter 1 the Fourier-Dedekind sum (defined in (1.13)) 


ef 


1 b—1 
Sn (a1, 02, .. . , aq; b) = ’ 
EPO Tw) 


~ Sb 





which appeared as a main player in our analysis of the Frobenius coin- 
exchange problem. We can now recognize the Fourier-Dedekind sums as hon- 
est finite Fourier series with period b. The Fourier-Dedekind sums unify many 
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variations of the Dedekind sum that have appeared in the literature, and form 
the building blocks of Ehrhart quasipolynomials. For example, we showed in 
Chapter 1 that sn (a1, @2,...,@a;b) appears in the Ehrhart quasipolynomial 
of the d-simplex 


{ (a1,-+-;2a41) € Ro 214%, +: + adata + bta = Ta 


Example 8.1. We first notice that when n = 0 and d = 2, the Fourier- 
Dedekind sum reduces to a classical Dedekind sum (which—finally—explains 
the name): for relatively prime positive integers a and b, 


b—1 


so(a, 1; b) = 





ole 


= Ej (1-g) 


= 
“ikha (aa 


TT 
ell 














ae 1 De ies 

"mrg wA E bi~4 
i I (28) +3 1  b-1l 
4b + |1- eke bi- ab 


In the last step we used the fact that multiplying the index k by a does not 
change the middle sum. This middle sum can be further simplified by recalling 
(1.8): 














b—1 
De, mn T (j+ n 
whence 
vand- 5° (HE) (8) | z a 
-g o (=) cot (7E) } i (8.1) 
SEa re. 











Example 8.2. The next special evaluation of a Fourier-Dedekind sum is very 
similar to the computation above, so that we leave it to the reader to prove 
(Exercise 8.5) that for a1, a2 relatively prime to b, 

b-1 


SO (a1, a2; b) =—S (aiaz 1b) + o ; (8.2) 











where a3 ‘az =1 mod b. 
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Returning to the general Fourier-Dedekind sum, we now prove the first 
of a series of reciprocity laws: identities for certain sums of Fourier—Dedekind 
sums. We first recall how these sums came up in Chapter 1, namely, from the 
partial fraction expansion of the function 











1 
OS ten (ee 
Ai A An Bı B2 Ba 
=—4 bes 4 pos 8.3 
z 2 m eat (z— 1)? Tem 1) (8:3) 
aı—1 a2— Qa— 
SSeS Oe tt Fe 
T __ Ek T __ ¢k Ts =< ah Ek . 
oE i AE E 
(Here we assume that a1,a2,...,Qq are pairwise relatively prime.) Theo- 


rem 1.7 states that with the help of the partial fraction coefficients B,,..., Bg 
and Fourier—Dedekind sums, we can compute the restricted partition function 
for A = {a1,a2,...,@a}: 








pa(n) = —B, + By —---+(—1)4Bg + s-n (a2, a3,..., ag; 01) 
+ Sin (a1, 03,04, .. -adj a2) +: + Sn (a1, @2,..-,@d—13; đa). 
We note that B1, B2,..., Bq are polynomials in n (Exercise 8.6), whence we 
call 
poly 4(n) := —B, + By —--+ + (-1)"Ba 


the polynomial part of the restricted partition function p(n). 


Example 8.3. The first few expressions for polysq, aq} (7) are 


1 
Plyta p(n) = —, 








ay 
n 1/1 1 
1 = i | , 
po Y {01,02} (7) aj,ag 2 e a) 
n2 n 1 1 1 
_ l | 8.4 
POLY fa, 09,03} (7) 2a1aza3 2 (— a,a3 =a) (8.4) 





1 3 3 3 ay a2 a3 
+ +—+—4 + | ; 
12 ay, ag a3 a243 a143 a142 
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n3 


POlY {a1 ,a2,as,aa} (2) > 601020304 





n? 1 1 1 1 
a T 
4 \a@aia20a3 1204 940304 20304 





n 1 l 1 1 1 1 1 
4 \ aiaz ! a103 Q14 Q203 Q204 34 








n ay 4 a2 i a3 a4 
ai a T 
12 \aga3a4q a1a3a4 Q1đa2đa4 Q1đ2403 


1 ( at at ay ag a2 a2 








a | | | fi fi 
T T T T 
24 \a2a3 a2đa4 Q304 Q103 a1d4 a3a4 


a3 
} t + } } t 
a a2 a a4 a2a4 a ag a a3 Q243 


1 ( 1 1 1 1 ) 
+ + H + : 
8 lai a2 ag a4 





a3 a3 a4 a4 as ) 

















We are about to combine the Ehrhart results of Chapter 3 with the partial 
fraction expansion of Chapter 1 that gave rise to the Fourier-Dedekind sums. 


Theorem 8.4 (Zagier reciprocity). For any pairwise relatively prime pos- 
itive integers a1, a2, ..., ad, 


So (a2, 43, . . . , Qd; 41) + So (a1, 43, Q4,- . , Qaj a2) +++: 


+ so (a1, @2,..., @d—13 Ga) 


=1- POLY tan ,a2,... aa} (0) g 


At first sight, this reciprocity law should come as a surprise. The Fourier- 
Dedekind sums can be complicated, long sums, yet when combined in this 
fashion, they add up to a trivial rational function in a1,đ2,..., aq. 


Proof. We compute the constant term of the quasipolynomial p4 (n): 
pa(0) = poly 4(0) + so (a2, a3, . . . , aa; a1) 
+ So (a1, a3, G4, . wee , Ad; a2) +:++++ 89 (a1, a2, ti .;Ad—1; Ad) - 


On the other hand, Exercise 3.27 (the extension of Corollary 3.15 to Ehrhart 
quasipolynomials) states that p4(0) = 1, whence 


1 = poly 4 (0) + so (a2, a3,...,@a} a1) 


+ so (@1, 43, Q4, . . - , Qd; a2) +--+ + So (Q1, G2,...,@d—13 Ga) - 
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8.2 The Dedekind Sum and Its Reciprocity 
and Computational Complexity 


We derived in (8.1) the classical Dedekind sum s(a, }) as a special evaluation 
of the Fourier—Dedekind sum. Naturally, Theorem 8.4 takes on a particular 
form when we specialize this reciprocity law to the classical Dedekind sum. 


Corollary 8.5 (Dedekind’s reciprocity law). For any relatively prime 
positive integers a and b, 


Ce BN 
sab talie) =" (5 | 7+) 4° 


Proof. A special case of Theorem 8.4 is 





so(a, 1; b) T so(b, a; 1) + so(1, b; a) =1- POLY {a,1,b} (0) 


1/3 3 1 b 
=1 ( FI Se ) 
a a 





12 b b ab 
3 1 a b 1 Ko i 
~4 12\b a ab 4a 4b` 


Now we use the fact that so(b,a; 1) = 0 and the identity (8.1): 





1 
Ab ` 

















1 
sola, 1;b) = —s(a,b) + 3 


Dedekind’s reciprocity law allows us to compute the Dedekind sum s(a, b) 
as quickly as the gcd algorithm for a and b. Let’s get a better feeling for 
the way we can compute the Dedekind sum by working out an example. We 
remind the reader of another crucial property of the Dedekind sums that we 
already pointed out in (7.5): s(a,b) remains invariant when we replace a by 
its residue modulo b, that is, 


s(a, b) = s(a mod b,b). (8.5) 


Example 8.6. Let a = 100 and b = 147. Now we alternately use Corollary 8.5 
and the reduction identity (8.5): 
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1/100 147 1 1 
100, 147) = 147,1 
T (= 100 m5) 5 ea) 
1249 
1249 1/47 100 1 1 
2 100,4 
17640 (5 (= 47 +a) gene 7) 
773 
S E, 
zo727 * (6.47) 
7733 1/6 47 1 1 
= m 4 
20727 ` 12 ($ a sss 17 SGS) 
166 
“m4m s(5,6) 
166 /1/5 6 1\ 1 
~ 441 (5 G | e+) 4 s(6:5)) 
2003 
Sore ee hea 
Zig + 415) 
_ 2003 1,1 5 
~ 4410 4 30 12 
_ 57 
~ 882° 
In the last step we used Exercise 7.20: s(1,k) = —} + a + 5. A priori, 


s(100, 147) takes 147 steps to compute, whereas we were able to compute it 
in nine steps using Dedekind’s reciprocity law and (8.5). 














As a second corollary to Theorem 8.4, we mention the following three- 
term reciprocity law for the special Fourier-Dedekind sum s9(a, b; c). This 
reciprocity law could be restated in terms of the classical Dedekind sum via 
the identity (8.2). 


Corollary 8.7. For pairwise relatively prime positive integers a, b, and c, 





1/3 3 3 b 
sola, b; c) + so(c, a; b) + so(b,c;a) = 1 +-+-+ Å: +— + =y, 
12 \a b c be ca ab 














8.3 Rademacher Reciprocity for the Fourier-Dedekind 
Sum 


The next reciprocity law will be again for the general Fourier-Dedekind sums. 
It extends Theorem 8.4 beyond n = 0. 


Theorem 8.8 (Rademacher reciprocity). Let a1,a2,...,aq be pairwise 
relatively prime positive integers. Then for n = 1,2,..., (a1 +: +aa-— 1), 
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Sn (@2,43,..-,@q3 41) + Sn (1, 43,04,-.-,@q}@2) + °°: 


+ Sn (az, a2,-++,Ad-13 ad) Tee POY {az az aa} (77) 3 


str 


Proof. We recall the definition 
pln) = #{(m,...,ma) EZ": all mj >0, mya, +--+ + Maaa =n} 


of Exercise 1.31, that is, p} (n) counts the number of partitions of n using 
only the elements of A as parts, where each part is used at least once. This 
counting function is, naturally, connected to pa through Ehrhart—Macdonald 
reciprocity (Theorem 4.1): 


that is, 


(—1)*"*p4(n) = poly 4(—n) + sn (a2, a3, . - . , aq; a1) 
+ Sn (G1, 43, 4,---,@q}@2) ++++ + Sn (01,02, . . - ,ad—1; Aa) - 


On the other hand, by its very definition, 
p(n) =0 for n=1,2,...,(a1 +- +aa-— 1), 
so that for those n, 


0 = poly 4(—n) + sn (a2, a3, ... , Qa; a1) 


+ Sn (a1, 03,04, . . . , Qd; G2) + +++ + Sn (a1, G2,...,@d—13 Qa). 














Just as Zagier reciprocity takes on a special form for the classical Dedekind 
sum, Rademacher reciprocity specializes for d = 2 to a reciprocity identity for 
the Dedekind-Rademacher sum 


ECO 


The classical Dedekind sum is, naturally, the specialization ro(a,b) = s(a, b). 
To be able to state the reciprocity law for the Dedekind-Rademacher sums, 
we define the function 
1 ifaln, 
Xaln) = , 


0 otherwise, 


which will come in handy as a book keeping device. 


Corollary 8.9 (Reciprocity law for Dedekind—Rademacher sums). 
Let a and b be relatively prime positive integers. Then for n = 1,2,...,a +b, 
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n? nfl 1 1 1 fab 1 
b b = H H } 
BC a a (= a 3 T2 (F ria =) 


(2) +()+©-+@) 


Je A (1+ xa(n) + xl(n)), 











where a~ta = 1 mod b and b-!b=1 moda. 


This identity follows almost instantly once we are able to express the 
Dedekind—Rademacher sum in terms of Fourier—-Dedekind sums. 


Lemma 8.10. Suppose a and b are relatively prime positive integers and n € 
Z. Then 


-1 


TMD) Ss Bhs) 5 (5) r 5 (= )) 7 a pel), 


where a~'a = 1 mod b. 





Proof. We start by rewriting the finite Fourier series (1.8) for the sawtooth 
function ((x)): 





DEE E rh 2b 





lI 
AEL 
T 
N 
Nia 

= 

~~" 
iw) 
Sle 





-() n ) a 


Hence we also have 


In 


1% ghn 1 eka 
oe gk 





b 


a~'n 1 Li 1 
=(F*)) +a -a 
eo cee 1 (n) 1 

=A OREO, abe 
Now we use the convolution theorem for finite Fourier series (Theorem 7.10) 
for the functions 

















ae 1 b-1 kn 
as ‘= b 
re z an and g(n) := 5 >, 1 — ee 


b 


It gives 
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oa y= È An matin) = 

pel T 

A eaa N a a) 
m=0 


We invite the reader to check (Exercise 8.9) that the sum on the right-hand 
side simplifies to 


a OAC) AO) de} 


m 








whence 


Cee eee ee ; (E) +5 (2) a + Tln). 


Proof of Corollary 8.9. We use the special case of Theorem 8.8 

















Sn (a, 1; b) ga SU, a; b) ae Sn (a, b; 1) ED Poly {a 1,8} (77) 


= n n E ta 1 Fe Pig a Oy al 

~ 2b 2\ab a’ b) 12\a" b b a ab)’ 
which holds for n = 1,2,...,a +b. Lemma 8.10 allows us to translate this 
identity into one for Dedekind-Rademacher sums: 


Palae) Era a) -5 (= ü | ğ 5) f 5 G i + =) 
ENCORE HONO) 


+ 5 (1+ a(n) + x(n). 



































The two-term reciprocity law allows us to compute the Dedekind- 
Rademacher sum as quickly as the gcd algorithm, just as obtained for the clas- 
sical Dedekind sum. This fact has an interesting consequence: In Theorem 2.10 
and Exercise 2.34 we showed implicitly (see Exercise 8.10) that Dedekind— 
Rademacher sums are the only nontrivial ingredients of the Ehrhart quasi- 
polynomials of rational polygons. Corollary 8.9 ensures that these Ehrhart 
quasipolynomials can be computed almost instantly. 


8.4 The Mordell—Pommersheim Tetrahedron 


In this section we return to Ehrhart polynomials and illustrate how Dedekind 
sums appear naturally in generating-function computations. We will study 
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the tetrahedron that historically first gave rise to the connection of Dedekind 
sums and lattice-point enumeration in polytopes. It is given by 


Pelen ei 2,y,220, FE <1}, (8.6) 


a tetrahedron with vertices (0, 0, 0), (a, 0,0), (0, b, 0), and (0,0, c), where a, b,c 
are positive integers. We insert the slack variable n and interpret 


k l 
Lp(t) = # Slm) EZ: kl,m>0, — +54 le i} 
a c 
= # {(k,l m,n) EZ: k,l ,m,n > 0, bck + acl +abm +n = abct} 
as the Taylor coefficient of 22° for the function 


X zbck X gacl X abm X z” 


k>0 1>0 m>0 n>0 
1 
(2) 0 — 9) 0) (2) 





As we have done numerous times before, we shift this coefficient to the con- 
stant term: 





1 
Lp(t) = const (z z2) (1 — 24°) (1 — 2%) (1 — 2) ==) i 


To reduce the number of poles, it is convenient to change this function slightly; 
the constant term of 1/ (1 — 2°) (1 — 2°°) (1 — 2%) (1 — z) is 1, so that 





gabet Soi 
rtd) = const (a a AA) +h 


This trick becomes useful in the next step, namely expanding the function into 
partial fractions. Strictly speaking, we cannot do that, since the numerator 
is not a polynomial in z. However, we can think of this rational function as 
a sum of two functions. The higher-order poles of both summands that we 
will not include in our computation below cancel each other, so we can ignore 
them at this stage. The only poles of 


gy abet =i 
(1 — ze) (1 — z2¢) (1 — 2°) (1 — 2) 





(8.7) 
are at the att, bt, ct roots of unity and at 0. (As before, we don’t have 
to bother with the coefficients of z = 0 of the partial fraction expansion.) To 
make life momentarily easier (the general case is the subject of Exercise 8.12), 
let’s assume that a, b, and c are pairwise relatively prime; then all the poles 
besides 0 and 1 are simple. The computation of the coefficients for z = 1 is 
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very similar to what we did with the restricted partition function in Chapter 1. 
The coefficient in the partial fraction expansion of a nontrivial root of unity, 
say €*, is also computed practically as easily as in earlier examples: it is 


t 
a (I= eE) 


(see Exercise 8.11). Summing this fraction over k = 1,2,...,a — 1 gives rise 
to the Fourier-Dedekind sum 





(8.8) 





t a-—1 1 
a 2 (1 — ékbe) (1 — £X) = —t so (bc, 1; a). 
k=1 a a 


Putting this coefficient and its siblings for the other roots of unity into the par- 
tial fraction expansion and computing the constant term yields (Exercise 8.11) 


abe 43 l ab+actbeti » 


6 4 
a+b+c 1 ae ee Pa E an AO ; 
4 "Ala b'c 12\a bc abc 


a (so (be, 1; a) + so (ca, 1; b) + 80 (ab, 1; c))t 
+1. 





Lp(t) = 











We recognize instantly that the Fourier-Dedekind sums in this Ehrhart poly- 
nomial are in fact classical Dedekind sums by (8.1), and so we arrive at the 
following celebrated result. 


Theorem 8.11. Let P be given by (8.6) with a, b, and c pairwise relatively 
prime. Then 








ip) = 8 Mentet | G ani 
1 /bc ca ab 1 

H b — b) — , 

rae bc =) s (be, a) — s (ca, b) s(absc)) t+ 














We finish this chapter by giving the Ehrhart series of the Mordell— 
Pommersheim tetrahedron P. It follows simply from the transformation for- 
mulas (computing the Ehrhart numerator coefficients from the Ehrhart poly- 
nomial coefficients) of Corollary 3.16 and Exercise 3.10, and hence the Ehrhart 
series of P naturally contains Dedekind sums. 


Corollary 8.12. Let P be given by (8.6) with a, b, and c pairwise relatively 
prime. Then 
hg zethgz?thyztl 
Eh = 
rye) G- 24 
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where 





hs =] 4 z 


— s (bc, a) — s (ca, b) — s (ab, c) 
p 20c at+b+c, 3,1 be ca ab 1 
LS OF APG 

— 2 (s (bc, a) + s (ca, b) + s (ab, c)) 

b b b b 1 /b 1 
fn ea, +c gi (=-242, ) 





abc ab+ac+bc+a+b+c 1 5(~+¢ ab =) 





a b c abc 





6 4 12 
— s (bc,a) — s (ca, b) — s (ab, c). 














It is a curious fact that the above expressions for hı, h2, and hg are non- 
negative integers due to Corollary 3.11. 


Notes 


1. The classical Dedekind sums came to life in the 1880s when Richard 
Dedekind (1831-1916)! studied the transformation properties of the Dedekind 


n-function [70] 
n(z) i emz/12 Il (1 = a) 
n>1 


a useful computational gadget in the land of modular forms in number theory. 
Dedekind’s reciprocity law (Corollary 8.5) follows from one of the functional 
transformation identities for 7. Dedekind also proved that 


12k s(h,k) =k+1—2 (7) (mod 8), 


establishing a beautiful connection between the Dedekind sum and the Jacobi 
symbol (2) (the reader may want to consult the lovely Carus monograph 
entitled Dedekind Sums, by Emil Grosswald and Hans Rademacher, where the 
above result is proved [151, p. 34]), and then used this identity to show that 
the reciprocity law for the Dedekind sums (for which [151] contains several 
different proofs) is equivalent to the reciprocity law for the Jacobi symbol. 


2. The Dedekind sums and their generalizations appear in various contexts be- 
sides analytic number theory and discrete geometry. Other mathematical areas 
in which Dedekind sums show up include topology [101, 131, 190], algebraic 
number theory [129, 166], and algebraic geometry [85]. They also have con- 
nections to algorithmic complexity [114] and continued fractions [10, 99, 138]. 


1 For more information about Dedekind, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Dedekind.html. 
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3. The reciprocity laws (Theorems 8.4 and 8.8) for the Fourier-Dedekind 
sums were proved in [25]. Theorem 8.4 is equivalent to the reciprocity law for 
Don Zagier’s higher-dimensional Dedekind sums [190]. Corollary 8.7 (stated 
in terms of the classical Dedekind sum) is originally due to Hans Rademacher 
[149]. Theorem 8.8 generalizes reciprocity laws by Rademacher [150] (essen- 
tially Corollary 8.9) and Ira Gessel [87]. 


4. The Fourier-Dedekind sums form only one set of generalizations of the 
classical Dedekind sums. A long, but by no means complete, list of other 
generalizations is [5, 6, 21, 34, 35, 36, 56, 77, 78, 87, 92, 93, 113, 129, 131, 
130, 150, 179, 190]. 


5. The connection of Dedekind sums and lattice-point problems, namely The- 
orem 8.11 for t = 1, was first established by Louis Mordell in 1951 [136]. Some 
42 years later, James Pommersheim established a proof of Theorem 8.11 as 
part of a much more general machinery [146]. In fact, Pommersheim’s work 
implies that the classical Dedekind sum is the only nontrivial ingredient one 
needs for Ehrhart polynomials in dimensions three and four. 


6. We touched the question of efficient computability of Ehrhart (quasi-)po- 
lynomials in this chapter. Unfortunately, our current state of knowledge on 
generalized Dedekind sums does not suffice to make any general statement. 
However, Alexander Barvinok proved in 1994 [15] that in fixed dimension, 
the rational generating function of the Ehrhart quasipolynomial of a ratio- 
nal polytope can be efficiently computed. Barvinok’s proof did not employ 
Dedekind sums but rather used a decomposition theorem of Brion, which is 
the subject of Chapter 9. 


Exercises 


8.1. Show that s(a,b) = 0 if and only if a? = —1 mod b. 


8.2. Prove that 6b s(a, b) € Z. (Hint: Start with rewriting the definition of the 
Dedekind sum as 6b s(a,b) = 84 5%] k? — 6 NRT k |5] -3 522] k.) 


8.3. Let a and b be any two relatively prime positive integers. Show that the 
reciprocity law for the Dedekind sums implies that for b = r mod a, 


12ab s(a, b) = —12abs(r,a) + a? + b? — 3ab+1. 


Deduce the following identities: 


(a) For b= 1 mod a, 





12ab s(a,b) = —a°b +b? +a? — 2b +1. 
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(b) For b = 2 mod a, 
12ab s(a, b) = — 50 +a? +0? —- 5 +1. 
(c) For b = —1 mod a, 
12ab s(a, b) = a°b +a? + b° — 6ab + 2b +1. 
8.4. Denote by fn the sequence of Fibonacci numbers, defined by 


fi=fe=1 and fn+2 = fn41 + fn forn>1. 


Prove that 
S (fok, for+1) =0 


and 
12 fox—1 for $ (fok-1, fok) = fan—1 + fon — 3fok-1 fər +1. 


8.5. & Prove (8.2): 


b-1 
80 (a1, 42; b) = —s (ayaz*,b) + rae 


where az ‘a2 = 1 mod b. 


8.6. Prove that B,, Bo,..., Ba in the partial fraction expansion (8.3), 

















1 
f(2) = (1—2%)---(1— 24a) gn 
A, A An Bı | bə Ba 
a e a E eee. ten Ne 

aı—1 a2—1 aa—l1 

Cik Cox Cak 
+ Hed l 
> a= Ek pliers are k 
are polynomials in n (of degree less than d) and rational functions in a1,..., aa. 


8.7. & Verify the first few expressions for poly,,, aa} (7) in (8.4). 


8.8. Show that the Dedekind—Rademacher sum satisfies r_,(a,b) = rn (a,b). 


aaa 


8.9. & Show that 


E (752) e-m- a) (G2) +30) 
5 (E=) ane N a 














m=0 


+ > x(n). 


Open Problems 153 


8.10. Rephrase the Ehrhart quasipolynomials for rational triangles given in 
Theorem 2.10 and Exercise 2.34 in terms of Dedekind—Rademacher sums. 


8.11. & Prove Theorem 8.11 by verifying (8.8) and computing the coefficients 
for z = 1 in the partial fraction expansion of (8.7). 


8.12. Generalize the Ehrhart polynomial of the Mordell-Pommersheim tetra- 
hedron to the case that a, b, and c are not necessarily pairwise relatively 
prime. 


8.13. Compute the Ehrhart polynomial of the 4-simplex 


4 Tı T2 T3 T4 
{ (1, 22,23, 24) Rso : a H b H 7 H d < i 





where a,b,c,d are pairwise relatively prime positive integers. (Hint: You may 
use Corollary 5.5 to compute the linear term.) 


Open Problems 


8.14. Find new relations between various Dedekind sums. 


8.15. It is known [21] that the Fourier-Dedekind sums are efficiently com- 
putable. Find a fast algorithm that can be implemented in practice. 


8.16. For any fixed integers b and k, find a nice characterization for the set 
of all a € Z such that s(a, b) = k. 
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The Decomposition of a Polytope 
into Its Cones 


Mathematics compares the most diverse phenomena and discovers the secret analo- 
gies that unite them. 


Jean Baptiste Joseph Fourier (1768-1830) 


In this chapter, we return to integer-point transforms of rational cones and 
polytopes and connect them in a magical way that was first discovered by 
Michel Brion. The power of Brion’s theorem has been applied to various do- 
mains, such as Barvinok’s algorithm in integer linear programming, and to 
higher-dimensional Euler—Maclaurin summation formulas, which we study in 
Chapter 10. In a sense, Brion’s theorem is the natural extension of the familiar 


: : a: j b b+1_ za : : 
finite geometric series identity J m-a 2” = *4* to higher dimensions. 





9.1 The Identity “X mez 2 = 0”... 
...or “Much Ado About Nothing” 


We start gently by illustrating Brion’s theorem in dimension one. To this end, 
let’s consider the line segment Z := [20,34]. We recall that its integer-point 
transform lists the lattice points in Z in the form of monomials: 


or(z) = J gm = pl HH 284 
mETAZ 


Already in this simple example, we are too lazy to list all integers in Z and 
use --- to write the polynomial øz. Is there a more compact way to write 
oz? The reader might have guessed it even before we asked the question: this 
integer-point transform equals the rational function 


20 _ 35 


mks l-z 
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This last sentence is not quite correct: the definition of oz(z) yielded a poly- 
nomial in z, whereas the rational function above is not defined at z = 1. 
We can overcome this deficiency by noticing that the limit of this rational 
function as z — 1 equals the evaluation of the polynomials oz(1) = 15, by 
L’Ho6pital’s rule. Notice that the rational-function representation of oz has 
the unquestionable advantage of being much more compact than the original 
polynomial representation. The reader who is not convinced of this advantage 
should replace the right vertex 34 of Z by 3400. 

Now let’s rewrite the rational form of the integer-point transform of Z 
slightly: 


720 _ 435 720 234 


oz(z) = = Migs a (9.1) 





There is a natural geometric interpretation of the two summands on the right- 
hand side. The first term represents the integer-point transform of the interval 
[20, 00): 





020,00) (2) T 5 z” = l-z j 


m>20 


The second term in (9.1) corresponds to the integer-point transform of the 
interval (—oo, 34]: 





734 
F (00, 34] (2) = ` z™ = = 
m<34 z 
So (9.1) says that on a rational-function level, 
020,00) (2) + O(—00,34] (2) = F [20,34] (2) - (9.2) 


This identity, which we illustrate graphically in Figure 9.1, should come 














° e > 
20 34 
< e e 
20 34 
° e 
20 34 


Fig. 9.1. Decomposing a line segment into two infinite rays. 


as a mild surprise. Two rational functions that represent infinite sequences 
somehow collapse, when being summed up, to a polynomial with a finite 
number of terms. We emphasize that (9.2) does not make sense on the level of 
infinite series; in fact, the two infinite series involved here have disjoint regions 
of convergence. 
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Even more magical is the geometry behind this identity: on the right-hand 
side we have a polynomial that lists the integer points in a finite interval P, 
while on the left-hand side each of the rational generating functions represents 
the integer points in an infinite ray that begins at a vertex of P. The two 
half-lines will be called vertex cones below, and indeed the remainder of this 
chapter is devoted to proving that an identity similar to (9.2) holds in general 
dimension. 

We now extend the definition of integer-point transforms o.4(z) from the 
case of cones A to the case of affine spaces A. Any such affine space A C R? 
equals w + V for some w € R? and some n-dimensional vector subspace 
V C R$, and if A contains integer points (which is the only interesting case 
for our purposes), we may choose w € Z4. The integer points VN Z? in V 
form a vector space, and hence there exists a basis v1, V2,...,Vn for VN Z?. 
This implies that any integer point m € AN Z? can be uniquely written as 


m = W + kivi + kVa +- + kEnVn for some kı, ko,...,kn E€ Z. 





Using this fixed lattice basis for V, we define the skewed orthants of A as the 
sets of the form {w + Aivi + À2V2 +--+ + ÀnVn}, where for each 1 < j < n, 
we require either A; > 0 or A; < 0. So there are 2” such skewed orthants, 
and their disjoint union equals A. We denote them by Q 1, Q2,...,OQgn. All of 
them are (half-open) pointed cones, and so their integer-point transforms are 
rational. 


Lemma 9.1. Suppose A is an n-dimensional affine space with skewed or- 
thants O1, O2,...,0an. Then as rational functions, 


Fo, (2) + G0,(%) +- + Fn (Zz) = 0. 
Proof. Suppose 
A= {w H AiVi i A2V2 fees AnVn: Ai, A2,-+-;An E R}. 





Then a typical skewed orthant O looks like 
O = {w + Avi + À2V2 +++ +AnVn: Alanin ANR > 0, Ak+1) +++ An <0}, 


and its integer-point transform is 





go(z) 
= 7 X gy1 se X gÎkVYk X gÎk+1Vk+1 Tini X gir Yn 
J120 Jk 20 Tk4+1<0 jn <0 
1 1 1 1 
— zy eae ... 
1— z”: L—2ve gveti— 1 g¥n —] 


Now consider the skewed orthant O’ with the same conditions on the A’s as in 
O except that we switch 41; > 0 to A, < 0. Then the integer-point transform 
of O' is 
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œ 1 1 1 1 1 
zı —11—z⁄2? 1— zyk zyk — 1 gZvn —]”’ 





co (z)=z 


so that oo(z) + co: (z) = 0. Since we can pair up all skewed orthants in this 
fashion, the sum of all their rational generating functions is zero. 














Since QO, U Og U- -- U Qgn is equal to A as a disjoint union, it now makes 
sense to set 
oalz):=0 (9.3) 


for any n-dimensional affine space with n > 0. Lemma 9.1 says that this defin- 
ition is not as arbitrary as it might seem, and the following result strengthens 
our motivation for the definition (9.3). 


Theorem 9.2. Given half-open pointed cones K1,K2,...,Km © R? with a 
common apex in Z? such that the disjoint union of K1, K2,...,Km is an affine 
space, then as rational functions, 


OK, (Z) + 0K, (Z) +++: + 0K,, (2) = 0. 


Proof. Suppose the disjoint union of K1,K9,...,Km is the n-dimensional 
affine space A, and w € Z? is the common apex of K1,K2,...,Km. Now 
we decompose A into the skewed orthants O1, O2,...,O2r, which are also 
pointed cones with common apex w. The intersection of one of the K;’s with 
one of the O;’s is again a half-open pointed cone, and all these cones form yet 
another disjoint union of A, which is a common refinement of the dissection 
of A in terms of the K,’s and in terms of the O;’s: 


A= |] KNO). 


1<j<m 
1<k<2” 


For each 1 < j < m, Kj = ae (Kj N Ok) as a disjoint union, and so we can 
write the integer-point transform of {Cj as 


Qn 
OK; (z) = DD OKjNOk (z) , 
k=1 
as a rational-function identity. Similarly, we obtain for each 1 < k < 2”, 


TO, (z) = > OKjNOk (z) : 


Thus 
m m 2” 2” m 2” 
X ox; (2) = 5 X ono. (2) =) So oxjno. (2) = D5 0, (2) = 0, 
j=l j=l k=1 k=1 j=1 k=1 














by Lemma 9.1. 


9.2 Tangent Cones and Their Rational Generating Functions 159 


9.2 Tangent Cones and Their Rational Generating 
Functions 


The goal of this section, apart from setting the language that allows us to prove 
Brion’s theorem, is to prove a sort of analogue of (9.2) in general dimension. 

We recall a definition that was touched on only briefly, in Exercise 3.14: 
A hyperplane arrangement %H is a finite collection of hyperplanes. An 
arrangement H is rational if all its hyperplanes are, that is, if each hy- 
perplane in H is of the form {x E RI: aya, + agry +--+ + agtg = b} for 
some @1,@2,...,@¢,0 E Z. An arrangement H is called a central hyperplane 
arrangement if its hyperplanes meet in (at least) one point. 

Our next definition generalizes (finally) the notion of a pointed cone, 
defined in Chapter 3. A convex cone is the intersection of finitely many 
half-spaces of the form {x ER? : azı +azz2 +++: + aata < b} for which the 
corresponding hyperplanes {x ER? : azı + agra +--+ adta = b} form a 
central arrangement. This definition extends that of a pointed cone: a cone 
is pointed if the defining hyperplanes meet in exactly one point. A cone is 
rational if all of its defining hyperplanes are rational. Cones and polytopes 
are special cases of polyhedra, which are convex bodies defined as the inter- 
section of finitely many half-spaces. 

We now attach a cone to each face F of P, namely its tangent cone, 
defined by 








Kr :={x+A(y- x): xXEF, y EP, AER}. 


It turns out that Ks is the smallest convex cone containing both span F and 
P. We note that Kp = span P. For a vertex v of P, the tangent cone Ky is 
often called a vertex cone; it is pointed. For a k-face F of P with k > 0, the 
tangent cone Kg is not pointed. For example, the tangent cone of an edge of 
a 3-polytope is a wedge. 


Lemma 9.3. For any face F of P, span F C KF. 


Proof. As x and y vary over all points of F, x + À (y — x) varies over span F. 














We note that this lemma implies that K+ contains a line, unless F is a 
vertex. More precisely, if K is not pointed, it contains the affine space span £F, 
which is called the apex of the tangent cone. (A pointed cone has a point as 
apex.) 

An affine space A C R equals w + V for some w € R? and some vector 
subspace V C R?. The orthogonal complement A- of this affine space A 
is defined by 

At := {xe R?: x-v=0 forallveV}. 


We note that A@ A+ = R?, which gives us the following result. 
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Lemma 9.4. For any face F of P, the tangent cone Kr has the decomposition 

Kr =spanF @ ((span F)* N Kr) k 
Consequently, unless F is a vertex, 

Ox,-(z) =0. 
Proof. Since span F @ (span F)* = R4, 
KF = (span F © (span F)") NKE 
= (span FNA KF) ((span)* N Kr) 
= spanF @ ((span)* N Kr) ; 


where the last step follows from Lemma 9.3. The second part of the lemma is 
immediate since o pan F@((span F)“nK x) (2) = span F(Z) O(span F)4aK (Z) and 


Ospan F(Z) = 0. 














Although we do not need this fact in the sequel, it’s nice to know that 
(span F)* N Kz is a pointed cone (see Exercise 9.1). 


9.3 Brion’s Theorem 


The following theorem is a classical identity of convex geometry named after 
Charles Julien Brianchon (1783-1864)! and Jørgen Pedersen Gram (1850- 
1916).? It holds for any convex polytope. However, its proof for simplices 
is considerably simpler than that for the general case. We need only the 
Brianchon—Gram identity for simplices, so we restrict ourselves to this special 
case. (One could prove the general case along similar lines as below; however, 
we would need some additional machinery not covered in this book.) The 
indicator function 1g of a set S C R?@ is defined by 


fl if xes, 
1s(x) =f if xg 8. 


Theorem 9.5 (Brianchon—Gram identity for simplices). Let A be a d- 
simplex. Then 


la(x)= > tee); 


FCA 
where the sum is taken over all nonempty faces F of A. 


1 For more information about Brainchon, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Brainchon.html. 


2 For more information about Gram, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Gram.html. 
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Proof. We distinguish between two disjoint cases: whether or not x is in the 
simplex. 


Case 1: x € A. Then x € K for all F C A, and the identity becomes 


dim A 
LS 5 (pon = 5 (a fk. 
FCA k=0 


This is the Euler relation for simplices, which we proved in Exercise 5.5. 


Case 2: x ¢ A. Then there is a unique minimal face F C A (minimal with 
respect to dimension) such that x € K and x € Kg for all faces G C A that 
contain F (Exercise 9.2). The identity to be proved is now 


o= Sep. (9.4) 
GDF 


The validity of this identity again follows from the logic of Exercise 5.5; the 
proof of (9.4) is the subject of Exercise 9.4. 














Corollary 9.6 (Brion’s theorem for simplices). Suppose A is a rational 
simplex. Then we have the following identity of rational functions: 


oa(z) = S> ox, (2). 
v a vertex of A 


Proof. We translate the Brianchon—Gram theorem into the language of inte- 
ger-point transforms: we sum both sides of the identity in Theorem 9.5 for all 


m € Z4, . 
X ta(m)z™= So SO i ik, (m) 2”, 


mEzZ4 méZ4 FCA 


which is equivalent to 


oa(z) = So (-1)9™* oq, (2): 


FCA 


But Lemma 9.4 implies that ox,(z) = 0 except when F is a vertex. Hence 


oa (z) = 5 OK, (Z) - 


v a vertex of A 














Now we extend Corollary 9.6 to any convex rational polytope: 


Theorem 9.7 (Brion’s theorem). Suppose P is a rational convex polytope. 
Then we have the following identity of rational functions: 


op(z)= Y ox,(2). (9.5) 


v a vertex of P 
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Proof. We use the same irrational trick as in the proofs of Theorems 3.12 and 
4.3. Namely we start by triangulating P into the simplices A1, Ag,..., Am 
(using no new vertices). Consider the hyperplane arrangement 


H := {span F : F is a facet of A1, Ao,..., or Am}. 


We will now shift the hyperplanes in H, obtaining a new hyperplane arrange- 
ment HS, Those hyperplanes of H that defined P now define, after shifting, 
a new polytope that we will call P**#t. Exercise 9.6 ensures that we can shift 
H in such a way that: 


e no hyperplane in H** contains any lattice point; 
Hsbift yields a triangulation of Ps"; 

e the lattice points contained in a vertex cone of P are precisely the lattice 
points contained in the corresponding vertex cone of Psi, 


This setup implies that 


the lattice points in P are precisely the lattice points in Ps; 

the lattice points in a vertex cone of Ps can be written as a disjoint 
union of lattice points in vertex cones of simplices of the triangulation that 
Hsbift induces on Ps, 


The latter two conditions, in turn, mean that Brion’s identity (9.5) follows 
from Brion’s theorem for simplices: the integer-point transforms on both sides 
of the identity can be written as a sum of integer-point transforms of simplices 
and their vertex cones. 














9.4 Brion Implies Ehrhart 


We conclude this chapter by showing that Ehrhart’s theorem (Theorem 3.23) 
for rational polytopes (which includes the integral case, Theorem 3.8) follows 
from Brion’s theorem (Theorem 9.7) in a relatively straightforward manner. 


Second proof of Theorem 3.23. As in our first proof of Ehrhart’s theorem, it 
suffices to prove Theorem 3.23 for simplices, because we can triangulate any 
polytope (using only the vertices). So suppose A is a rational d-simplex whose 
vertices have coordinates with denominator p. Our goal is to show that, for a 
fixed 0 < r < p, the function La(r + pt) is a polynomial in t; this means that 
La is a quasipolynomial with period dividing p. 

By Theorem 9.7, 


La(r+pt) = 5 1 
mé€(r+pt) ANZ4 


= lim o(r +p a(Z) (9.6) 
= lim > O(r+pt)Ky (z) . 


zl 
v vertex of A 
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We used the limit computation for the integer-point transform o(,4p1)a4 rather 
than the evaluation 0(,4p1)(1), because this evaluation would have yielded 
singularities in the rational generating functions of the vertex cones. Note that 
the vertex cones Ky are all simplicial, because A is a simplex. So suppose 


Ky = {v + àw + A2w2 +--+ + Aawa : M1, A2,--+;Ada = 0}; 
then 


(r+ pt)Ky = {(r + pt)v + Arwy + Agw2 +--+ + AqWa: Ai, A2,---,Aad > OF 
= tpv + {rv + Aiwi + A2w2 +-+- + AaWa : A1,A2,---,Aa = OF 
=tpv+rky. 








What’s important to note here is that pv is an integer vector. In particular, 
we can safely write 


O(rtpt)Ky (Z) = zPY orn, (Z) 


(we say “safely” because tpv € Zf, so z’?Y is indeed a monomial). Now we 
can rewrite (9.6) as 


La(r + pt) = lim 5 zPY opg, (z). (9.7) 
aet v vertex of A 
The exact forms of the rational functions orx, (zZ) is not important, except for 
the fact that they do not depend on t. We know that the sum of the generating 
functions of all vertex cones is a polynomial in z; that is, the singularities of 
the rational functions cancel. To compute La(r + pt) from (9.7), we write 
all the rational functions on the right-hand side over one denominator and 
use L’Hôpital’s rule to compute the limit of this one huge rational function. 
The variable t appears only in the simple monomials z‘*?Y, so the effect of 
L’Hopital’s rule is that we obtain linear factors of t every time we differentiate 
the numerator of this rational function. At the end we evaluate the remaining 
rational function at z = 1. The result is a polynomial in t. 














Notes 


1. Theorem 9.5 (in its general form for convex polytopes) has an interesting 
history. In 1837 Charles Brianchon proved a version of this theorem involving 
volumes of polytopes in R? [44]. In 1874 Jørgen Gram gave a proof of the same 
result [88]; apparently he was unaware of Brianchon’s paper. In 1927 Duncan 
Sommerville published a proof for general d [167], which was corrected in the 
1960s by Victor Klee [111], Branko Griinbaum [90, Section 14.1], and many 
others. 


2. Michel Brion discovered Theorem 9.7 in 1988 [45]. His proof involved the 
Baum—Fulton—Quart Riemann—Roch formula for equivariant K-theory of toric 
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varieties. A more elementary proof of Theorem 9.7 was found by Masa-Nori 
Ishida a few years later [103]. Our approach in this chapter follows [27]. 


3. As we have already remarked earlier, Brion’s theorem led to an efficient 
algorithm by Alexander Barvinok to compute Ehrhart quasipolynomials [15]. 
More precisely, Barvinok proved that in fixed dimension, one can efficiently? 
compute the Ehrhart series }>,.) Lp(t)z* as a short sum of rational func- 
tions. Brion’s theorem essentially reduces the problem to computing the 
integer-point transforms of the rational tangent cones of the polytope. Barvi- 
nok’s ingenious idea was to use a signed decomposition of a rational cone to 
compute its integer-point transform: the cone is written as a sum and differ- 
ence of unimodular cones, which we will encounter in Section 10.4 and which 
have a trivial integer-point transform. Finding a signed decomposition involves 
triangulations, Minkowski’s theorem on lattice points in convex bodies (see, 
for example, [57, 133, 140, 163]), and the LLL algorithm which finds a short 
vector in a lattice [121]. At any rate, Barvinok proved that one can find a 
signed decomposition quickly, which is the main step towards computing the 
Ehrhart series of the polytope. Barvinok’s algorithm has been implemented 
in the software packages barvinok [185] and LattE [66, 67, 115]. Barvinok’s 
algorithm is described in detail in [13]. 


Exercises 


9.1. & Prove that for any face F of a polytope, (span Fy" N KF is a pointed 
cone. (Hint: Show that if H is a defining hyperplane for F, then HN(span F) 
is a hyperplane in the vector space (span F j=j 


9.2. & Suppose A is a simplex and x ¢ A. Prove that there is a unique 
minimal face F C A (minimal with respect to dimension) such that the corre- 
sponding tangent cone Kr contains x. Show that x € Kg for all faces GC A 
that contain F, and x ¢ Kg for all other faces G. 


9.3. Show that Exercise 9.2 fails to be true if A is a quadrilateral (for exam- 
ple). Show that the Brianchon—Gram identity holds for your quadrilateral. 


9.4. & Prove (9.4): for a face F of a simplex A, 


5 (—1)"m9 = 0, 


GDF 
where the sum is taken over all faces of A that contain F. 


3 “Efficiently” here means that for every dimension, there exists a polynomial that 
gives an upper bound on the running time of the algorithm, when evaluated at 
the logarithm of the input data of the polytope (e.g., its vertices). 

4 “Short” means that the set of data needed to output this sum of rational functions 
is also of polynomial size in the logarithm of the input data of the polytope. 
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9.5. Give a direct proof of Brion’s theorem for the 1-dimensional case. 


9.6. & Provide the details of the irrational-shift argument in the proof of 
Theorem 9.7: Given a rational polytope P, triangulate it into the simplices 
Ay, Ag,..., Am (using no new vertices). Consider the hyperplane arrangement 


H := {span F : F is a facet of A1, Ao,..., Am}. 


We will now shift the hyperplanes in H, obtaining a new hyperplane arrange- 
ment HS, Those hyperplanes of H that defined P now define, after shifting, 
a new polytope that we will call Pf. Prove that we can shift H in such a 
way that: 


no hyperplane in H*#* contains any lattice point; 

Hsbift yields a triangulation of Ps"; 

the lattice points contained in a vertex cone of P are precisely the lattice 
points contained in the corresponding vertex cone of Pst, 


9.7. & Prove the following “open polytope” analogue for Brion’s theorem: If 
P is a rational convex polytope, then we have the identity of rational functions 


ope(z)= > — oxe(z). 


v a vertex of P 
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Euler—Maclaurin Summation in R 


All means (even continuous) sanctify the discrete end. 


Doron Zeilberger 


Thus far we have often been concerned with the difference between the dis- 
crete volume of a polytope P and its continuous volume. In other words, the 


quantity 
5 1- f dy, (10.1) 
P 


mePnZ4 
which is by definition Lp(1) — vol(P), has been on our minds for a long time 
and has arisen naturally in many different contexts. An important extension is 
the difference between the discrete integer-point transform and its continuous 


sibling: 

5 e™*— f e”*¥dy, (10.2) 

mePnZ4 n 

where we have replaced the variable z that we have commonly used in generat- 
ing functions by the exponential variable (21, Z2,..., Z4) = (e%1,e72,..., e74). 
Note that upon setting x = 0 in (10.2) we get the former quantity (10.1). Re- 
lations between the two quantities } ` mepnga e™™* and J eY'*¥dy are known as 
Euler—Maclaurin summation formulas for polytopes. The “behind-the-scenes” 
operators that are responsible for affording us with such connections are the 
differential operators known as Todd operators, whose definition utilizes the 
Bernoulli numbers in a surprising way. 


10.1 Todd Operators and Bernoulli Numbers 


Recall the Bernoulli numbers B; from Section 2.4, defined by the generating 
function 
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We now introduce a differential operator via essentially the same generating 
function, namely 


k 
Todd), := 1 pen (=) i (10.3) 


k>1 


This Todd operator is often abbreviated as 


d 
Todd, = eee 
l—e a 
but we should keep in mind that this is only a shorthand notation for the 
infinite series (10.3). We first show that the exponential function is an eigen- 


function of the Todd operator. 


Lemma 10.1. For z € C \ {0} with |z| < 27, 


z ezh 


Todd), e7” = f 
1 — e7? 





Proof. 


B, / aN" 
Toddp e7” = Dee (=) er 


k>0 


Bk s 
= PD ee ern: x 
k>0 i 
P Br 
=e Gea, 
k>0 
zh * 


=e ; 
e 7-1 














The condition |z| < 27 is needed in the last step, by Exercise 2.14. 





The Todd operator is a discretizing operator, in the sense that it transforms 
a continuous integral into a discrete sum, as the following theorem shows. 


Theorem 10.2 (Euler—Maclaurin in dimension 1). For alla << beEZ 
and z E€ C with |z| < 27, 


b+hı 
Todd», Todd», f edz 


a—hg 





b 
= > Ez, 
k=a 
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Proof. Case 1: z = 0. Then e** = 1, and so 

















b+hi 
Todd», Todd, f edz 
a—h2 hı=h2=0 
b+hi 
= Todda, Toddh, f dx 
a—he hi=h2=0 
=b-at Todda, hy Th Todd}, hal, =ho=0 
=b-a+hi4 : + he 4 : 
2 2 lay =ho=0 





=b-a+l1 


by Exercise 10.1. Since Eer e10 = b— a+ 1, weve verified the theorem in 
this case. 


Case 2: z #0. Then 


b+hı 
1 
Todd, Toddn, 1 e7" de = Toddy, Toddn, = (Cae = geta) 


a—h2 











1 
= — (Todd), e7°**"" — Todd), e7**") 
z 
e”? za 
= — Todd», e _ — Todda, e74 
ez z eži e%4 —z eo zha 
= gz l—-e- z l-e ’ 
where the last step follows from Lemma 10.1. Hence 
b+hi 1 1 
Todda, Toddp, f edax =e ak 
a—he 1 — e77 1 — e” 
hi=h2=0 





e2(o+1) — era 


e* —1 














We will need a similar multivariate version of the Todd operator later, so 
that we define for h = (hy, ho,...,hm), 


m 


OL 
Todd, := [| oo en 
j=l 1- exp (-%) 


keeping in mind that this is a product over infinite series of the form (10.3). 
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10.2 A Continuous Version of Brion’s Theorem 


In the following two sections, we develop the tools that, once fused with the 
Todd operator, will enable us to extend Euler—Maclaurin summation to higher 
dimensions. A lemma, which is of independent interest, but will be used in 
the proof of the continous version of Brion’s theorem, now follows. 


Lemma 10.3. Suppose W1, W2,..., Wq € Z? are linearly independent, and let 
TI = {Aywi + Agwe +--+ + Aawa: 0 < Ai, A2,---,Ad < 1}. 


Then 
# (ILA Z*) = vol II = |det (wi,...,wa)| 


and for any positive integer t, 
# (tN Z*) = (vol II) t*. 


In other words, for the half-open parallelepiped I, the discrete volume 
# (tN Z?) coincides with the continuous volume (vol IT) ¢*. 


Proof. Because II is half open, we can tile the tt! dilate tII by tt translates 
of II, and hence 


In(t) = # (tN Z?) = # (IN Z?) tt. 


On the other hand, by the results of Chapter 3, Ly(t) is a polynomial with 
leading coefficient vol II = |det (wi,...,wa)|. Since we have equality of these 
polynomials for all positive integers t, 





# (IIN Z*) = vol IL. 











We now give an integral analogue of Theorem 9.7 for simple rational poly- 
topes. We start by translating Brion’s integer-point transforms 


op(z) = >> ox, (2) 
v a vertex of P 
into an exponential form: 
op(expz) = 5 oç, (expzZ). 
v a vertex of P 


For the continuous analogue of Brion’s theorem, we replace the sum on the 
left-hand side, 


op(expZ) = 5 (expz)™ = 5 exp(m-2z) , 


meEPnNZ4 meEPNZ4 


by an integral. 
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Theorem 10.4 (Brion’s theorem: continuous form). Suppose P is a 
simple rational convex d-polytope. For a vertex cone Ky of P, fix a set of 





generators w1(v),Wo(v),...,wa(v) € Zt. Then 
ey eee ae exp (v - z) |det (wi(v),...,wa(v))| 
J ° a ) l va 2 of P Ii- (wk (v) ` Z) 


for all z such that the denominators on the right-hand side do not vanish. 


Proof. We start with the assumption that P is an integral polytope; we will 
see in the process of the proof that this assumption can be relaxed. Let’s 
write out the exponential form of Brion’s theorem (Theorem 9.7), using the 
assumption that the vertex cones are simplicial (because P is simple). By 
Theorem 3.5, 





pD e ET E F exp(v - Zz) on, (exp Z) (10.4) 


mEPNZI v a vertex of P Ik= (1 — exp (wk (v) ` z)) 
where 
v = {Arwi(v) + à2w2(v) +--+ + Agwa(v) : 0 < Aq, Ag,---,Aa < 1} 


is the fundamental parallelepiped of the vertex cone Ky. We would like to 
rewrite (10.4) with the lattice Z? replaced by the refined lattice (22)%, be- 
cause then the left-hand side of (10.4) will give rise to the sought-after integral 
by letting n approach infinity. The right-hand side of (10.4) changes accord- 
ingly; now every integral point has to be scaled down by i; 





y exp(m+z) = ys en `Z) X mensogi — Ge : z) , 
mePn(2Z)* v a vertex of P Ik= (1 — exp (x0 i z)) 
(10.5) 
The proof of this identity is in essence the same as that of Theorem 3.5; we 
leave it as Exercise 10.2. Now our sought-after integral is 


f owt: z)dx = lim = 5 exp(m- z) 


n— o0 ne 
mePn(4Z)* 


z) 





= Tim i pD exp(v : z) Smet NZa ou 


~ n>œ nd w 
He v a vertex of P Ik- I (1 — exp (=> 1 din)) 
(10.6) 


At this point we can see that our assumption that P has integral vertices can 
be relaxed to the rational case, since we may compute the limit only for n’s 
that are multiples of the denominator of P. The numerators of the terms on 
the right-hand side have a simple limit: 
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jim, exp(v - z) 5 exp (= : z) = exp(v- Z) 5 1 


meéll,nZ4 meéll,nZ4 


= exp (v +z) |det (wi(v),...,walv))|, 
where the last identity follows from Lemma 10.3. Hence (10.6) simplifies to 


exp (v - z) |det (wi(v),-.-,Wa(v))| 
v a vertex of P Mi limno n (1 — exp (=~ : z)) 





IESE -Z)dx = 


Finally, using L’Hopital’s rule, we have 


ee res O ec 


and the theorem follows. 














It turns out (Exercise 10.4) that for each vertex cone Ky, 


aexD (v - z) |det (wi(v),...,wa(v))| 
Tins (we (v) -2) 


and Theorem 10.4 shows that the Fourier—Laplace transform of P equals the 
sum of the Fourier—Laplace transforms of the vertex cones. In other words, 


I exp(x : Z) dx = 5 J exp(x- Z) dx. 


v a vertex of P 





f exp(x -z) dx = (—1) : (10.7) 
Ky 


We also remark that |det (w)(v),...,wa(v))| has a geometric meaning: it 
is the volume of the fundamental parallelepiped of the vertex cone Ky. 

The curious reader might wonder what happens to the statement of The- 
orem 10.4 if we scale each of the generators w;,(v) by a different factor. It is 
immediate (Exercise 10.5) that the right-hand side of Theorem 10.4 remains 
invariant. 


10.3 Polytopes Have Their Moments 


The most common notion for moments of a set P C R? is 


Ha = f ytay =f up yg dy 
P P 


for any fixed vector a = (a1,a2,..., aq) € C4. For a = 0 = (0,0,...,0), we 
get uo = vol P. As an application of moments, consider the problem of finding 
the center of mass of P, which is defined by 


1 
d dy,... dy}. 
vol P (fw yo fa y., fu y) 
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This integral is equal to 


za (1(1,0,0,...,0) » £4(0,1,0,...,0) tee , H(0,...,0,1)) : 
Ho 
Similarly, one can define the variance of P and other statistical data attached 
to P and use moments to compute them. 
Our next task is to present the moments Ha in terms of Theorem 10.4. We 
make the change of variables y = e** in the defining integral of ua: 


Ha = f y®dy = if e7171 e7202... edd e71 672... e7d dx = | et) dy, 
P P P 


Thus Theorem 10.4 gives us the following formulas for the moments of a simple 
rational d-polytope P: 


a= (1) exp (v - (a + 1)) |det (wi(v),...,wa(v))| 
i l l v pa of P Mi= (w(v) ý (a+ 1)) 





9 


for all a such that the denominators on the right-hand side do not vanish. 
Going a step further, we can use Theorem 10.4 to obtain information for a 
different set of moments. Along the way, we stumble on an amazing formula 
for the continuous volume of a polytope. 


Theorem 10.5. Suppose P is a simple rational convex d-polytope. For a ver- 
ter cone Ky of P, fix a set of generators wi(v), wo(v),...,wa(v) € Zt. Then 


vol P = 





(-1)4 3 (v - z)? |det (wi(v),...,wa(v))| 


! d 
d! v a vertex of P Lk- (Wi (v) ` Z) 


for all z such that the denominators on the right-hand side do not vanish. 
More generally, for any integer j > 0, 








e a (v +z)’ ** |det (wi(v),..., wa(v))| 
f ya X 


(G + d)! v a vertex of P Ik= (wk (v) . z) 


Proof. We replace the variable z in the identity of Theorem 10.4 by sz, where 
s is a scalar: 





? 


xp (x - (sz)) dx = (—1)4 exp (v - (sz)) |det (wi(v),..., wa(v))| 
[owe aa E 


v a vertex of P Le (wk (v) ` (sz)) 


which can be rewritten as 


otie aidez d exp (s (v - z)) |det (w1 (v), ..., wa(v))| 
E ED 


v a vertex of P sf Mi= (wk (v) : (z)) 
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The general statement of the corollary follows now by first expanding the 
exponential functions as Taylor series in s, and then comparing coefficients 
on both sides: 


E 


, sJ—4 |de Wi1(V),...,WalVv 
ie S Tea See a) 


a a 
v a vertex of P j>0 J H=: (w:(v) A (z)) 




















-z))** |det (wi(v),...,Wa(v sf 
2S (ee: Se (v-z) |det (wi(v) d(v))| 


j>—d v a vertex of P Ths (wz(v) ` (z)) (j + d)! i 


The proof of this corollary reveals yet more identities between rational 
functions. Namely, the coefficients of the negative powers of s in the last line 
of the proof have to be zero. This immediately yields the following curious set 
of d identities for simple d-polytopes: 


Corollary 10.6. Suppose P is a simple rational convex d-polytope. For a ver- 
tex cone Ky of P, fix a set of generators w1 (v), w2(v),...,wa(v) € Zt. Then 
forO<j<d-1, 

















D (v: z) |det (wi(v),..-,walv))I o. 


v a vertex of P Ik= (wk (v) ; (z)) 


10.4 From the Continuous to the Discrete Volume 
of a Polytope 


In this section, we apply the Todd operator to a perturbation of the continuous 
volume. Namely, consider a simple full-dimensional polytope P, which we may 
write as 


P={xeR’: Ax<b}. 
Then we define the perturbed polytope 





P(h):= {xe R?: Ax<b+h} 


for a “small” vector h € R™. A famous theorem due to Askold Khovanskii and 
Aleksandr Pukhlikov says that the integer-point count in P can be obtained 
from applying the Todd operator to vol (P(h)). Here we prove the theorem 
for a certain class of polytopes, which we need to define first. 

We call a rational pointed d-cone unimodular if its generators are a basis 
of Zt. An integral polytope is unimodular if each of its vertex cones is 
unimodular.? 


1 Unimodular polytopes go by two additional names, namely smooth and 
Delzant. 
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Theorem 10.7 (Khovanskii—Pukhlikov theorem). For a unimodular d- 
polytope P, 
# (P N Z?) = Todda vol (P(h))|y~0 - 


More generally, 


op(expz) = Todds | exp(x - Zz) dx 
P(h) 





h=0 
Proof. We use Theorem 10.4, the continuous version of Brion’s theorem; note 
that if P is unimodular, then P is automatically simple. For a vertex cone 
Ky of P, denote its generators by wi(v), wo(v),...,wa(v) € Zt. Then The- 
orem 10.4 states that 

exp (v - z) [det (wi(v),..-,wav))| 

d 

v a vertex of P pat (Wi (v) ` z) 


=) SO eSB. (10.8) 


d 
v a vertex of P Ik= (wk (v) ; Z) 





| exp(x : z) dx = (—1)¢ 
P 





where the last identity follows from Exercise 10.3. A similar formula holds 
for P(h), except that we have to account for the shift of the vertices. The 
vector h shifts the facet-defining hyperplanes. This shift of the facets induces 
a shift of the vertices; let’s say that the vertex v gets moved along each edge 
direction w; (the vectors that generate the vertex cone Ky) by hz (v), so that 
P(h) has now the vertex v — Si hk(v)wų(v). If h is small enough, P(h) 
will still be simple,? and we can apply Theorem 10.4 to P(h): 


exp ((v — D hu(v)we(v)) . z) 





f exp(x : z) dx = (—1)¢ 5 
P(h) 








v a vertex of P Ik= (wk (v) : z) 
= (—1)? ea (v -z — pay ha (v)we(v) z) 
7 v a vertex of P Ik= (wk (v) i Z) 
d 
wee exp(v : z) [Ty exp (~hulv)welv) 2) 
va 2 of P Ik= (wx(v) i Z) 


Strictly speaking, this formula holds only for h € Q™, so that the vertices 
of P(h) are rational. Since we will eventually set h = 0, this is a harmless 
restriction. Now we apply the Todd operator: 


? The cautious reader may consult [193, p. 66] to confirm this fact. 
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Toddy f exp(x : Z) dx 
P(h) 





h=0 
d 
oa rota, PCW) TIE exp (—he(wywa(v) 2) 
v 2 of P i Tes (wk (v) ` Z) 
= (-1)4 ss exp(v - z) 


d 
v vertex of P rat (wx(v) : Z) 
d 


x |] Todda, c) exp (—he(v)we(v) z) 
k=1 








h=0 








hr (v)=0 


By a multivariate version of Lemma 10.1, 


Toddy f exp(x - z) dx 
P(h) 





h=0 


d 
= lag 5y exp (v i z) I] —wy(v) ‘2 


v vertex of P Ii- (wz(v) j z) k=1 Ls exp(w;,.(v) k z) 


d 
1 
= » awa l ax 


v vertex of P Wk (v) ' z) 








However, Brion’s theorem (Theorem 9.7), together with the fact that P is 
unimodular, says that the right-hand side of this last formula is precisely the 
integer-point transform of P (see also (10.8)): 


Toddy f exp(x - z) dx 
P(h) 





Finally, setting z = 0 gives 


Toddnp | dx 
P(h) 





h=0 meEPNZ4 











as claimed. 





We note that Jow exp(x - z) dx is, by definition, the continuous Fourier- 
Laplace transform of P(h). Upon being acted on by the discretizing operator 
Toddn, fp (h) exp(x-z) dx gives us the discrete integer-point transform op (z). 


Notes 


1. The classical Euler-Maclaurin formula states that 
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z 5 f0) + f(n) D Bom [2m1 $ 
LE A a E a Pan) 


1 n 

B (2p+1) (7) q 
+ apa |, Pen (lah) £0 (@) de, 
where B;,(x) denotes the kt? Bernoulli polynomial. It was discovered indepen- 
dently by Leonhard Euler and Colin Maclaurin (1698-1746). This formula 
provides an explicit error term, whereas Theorem 10.2 provides a summation 
formula with no error term. 


2. The Todd operator was introduced by Friedrich Hirzebruch in the 1950s 
[100], following a more complicated definition by J. A. Todd [181, 182] some 
twenty years earlier. The Khovanskii-Pukhlikov theorem (Theorem 10.7) 
can be interpreted as a combinatorial analogue of the algebro-geometric 
Hirzebruch—Riemann—Roch theorem, in which the Todd operator plays a 
prominent role. 


3. Theorem 10.4, the continuous form of Brion’s theorem, was generalized by 
Alexander Barvinok to any polytope [11]. In fact, [11] contains a certain ex- 
tension of Brion’s theorem to irrational polytopes as well. The decomposition 
formula for moments of a polytope in Theorem 10.5 is due to Michel Brion 
and Michéle Vergne [46]. 


4. Theorem 10.7 was first proved in 1992 by Askold Khovanskii and Alek- 
sandr Pukhlikov [108]. The proof we give here is essentially theirs. Their paper 
[108] also draws parallels between toric varieties and lattice polytopes. Subse- 
quently, many attempts to provide formulas for Ehrhart quasipolynomials— 
some based on Theorem 10.7—have provided fertile ground for deeper con- 
nections and future work; a long but by no means complete list of references 
is [9, 33, 46, 55, 60, 61, 76, 91, 106, 107, 119, 125, 137, 146, 178]. 


Exercises 


10.1. & Show that Todd, h = h + Ł, More generally, prove that Todda hk = 
By(h + 1) for k > 1, where B(x) denotes the kt! Bernoulli polynomial. 


10.2. & Prove (10.5): Suppose P is a simple integral d-polytope. For a vertex 
cone Ky of P, denote its generators by wi(v), wa(v),-..,wa(v) € Z? and its 
fundamental parallelepiped by Iy. Then 





5 exp(m-z)= D exp(v z) Smentynas Op (ii) 


d . 
mePn(4zZ)* v a vertex of P IS (1- exp (=m -z)) 


3 For more information about Maclaurin, see 
http://www-groups.dcs.st-and.ac.uk/~history/Biographies/Maclaurin.html. 
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10.3. & Given a unimodular cone 
K = {v + wi + Aowe +--+ + Aawa : A1, A2,+--,Ad = 0}, 


where v, w1, W2,..., Wq € Z® such that w1, w2,..., wq are a basis for Z?, 
show that 


ZY 


I, a-z) 


ox (Z) 


and |det (wi,...,Wa)| = 1. 
10.4. & Prove (10.7). That is, for the simple cone 
d 
K= {y+ ode : Àk >0] 
k=1 


with v, w1, W2, ..., Wq E Q%, show that 





a €Xp (v - z) |det (w1 (v), ..., wa(v))| l 
Tins (we(v) -2) 


10.5. Show that in the statement of Theorem 10.4, the expression 


f exp(x -z) dx = (—1) 
K 


Idet (wi (v),..-,wa(v))| 
Ik- (we(v) - 2) 


remains invariant upon scaling each w;(v) by an independent positive integer. 





Open Problems 


10.6. Find all differentiable eigenfunctions of the Todd operator. 


10.7. Classify all polytopes whose discrete and continuous volumes coincide, 
that is, Lp(1) = vol P. 


10.8. Which integer polytopes have a triangulation into d-simplices such that 
each of the simplices is unimodular? 
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Solid Angles 


Everything you’ve learned in school as “obvious” becomes less and less obvious as 
you begin to study the universe. For example, there are no solids in the universe. 
There’s not even a suggestion of a solid. There are no absolute continuums. There 
are no surfaces. There are no straight lines. 


Buckminster Fuller (1895-1983) 


The natural generalization of a two-dimensional angle to higher dimensions is 
called a solid angle. Given a pointed cone K C R%, the solid angle at its apex 
is the proportion of space that the cone K occupies. In slightly different words, 
if we pick a point x € R? “at random,” then the probability that x € K is 
precisely the solid angle at the apex of K. Yet another view of solid angles is 
that they are in fact volumes of spherical polytopes: the region of intersection 
of a cone with a sphere. There is a theory here that parallels the Ehrhart 
theory of Chapters 3 and 4, but which has some genuinely new ideas. 


11.1 A New Discrete Volume Using Solid Angles 


Suppose P C Rê is a convex rational d-polyhedron. The solid angle wp(x) 
of a point x (with respect to P) is a real number equal to the proportion of a 
small ball centered at x that is contained in P. That is, we let B.(x) denote 
the ball of radius € centered at x and define 


vol (B(x) N P) 


EV) vol B(x) 


for all positive e sufficiently small. We note that when x ¢ P, wp(x) = 0; 
when x € P°, wp(x) = 1; when x € OP, 0 < wp(x) < 1. The solid angle 
of a face F of P is defined by picking any point x in the relative interior F° 
and setting wp(F) = wp(x). 
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Example 11.1. We compute the solid angles of the faces belonging to the 
standard 3-simplex A = conv {(0,0,0), (1,0, 0), (0, 1,0), (0,0, 1)}. As we just 
mentioned, a point interior to A has solid angle 1. Every facet has solid angle 
5 (and this remains true for any polytope). 

The story gets interesting with the edges: here we are computing dihedral 
angles. The dihedral angle of any one-dimensional edge is defined by the 
angle between the outward-pointing normal to one of its defining facets and 
the inward-pointing normal to its other defining facet. 


Each of the edges OA, OB, and OC in the above figure have the same 
solid angle L, Turning to the edge AB, we compute the angle between its 


defining facets as follows: 


cos”! (= (—1,—1,—1) - (0,0, -1)) = cos + (=) 


The edges AC and BC have the same solid angle by symmetry. 

Finally, we compute the solid angle of the vertices: the origin has solid 
angle z, and the other three vertices all have the same solid angle w. With 
Corollary 11.9 below (the Brianchon-Gram relation), we can compute this 


angle via 





; 1 1 1 1 
0= X (-1)9™%wp(F) = -14+4-5-3- 3- cos? | —=] += +3: 
a ) wp(F) 5 i cos (=) + gt w, 





š $ DO e O E A E S 
which gives w = cos ( 4) z 











We now introduce another measure of discrete volume; namely, we let 
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Ap(t):= XO wp), 


metPnZ4 


the sum of the solid angles at all integer points in tP; recalling that wp(x) = 0 
if x ¢ P, we can also write 


Ap(t)= X` wp(m). 
meZ4 


This new discrete volume measure differs in a substantial way from the 
Ehrhart counting function Lp(t). Namely, suppose P is a d-polytope that can 
be written as the union of the polytopes Pı and P2 such that dim (P1 N P2) < 
d, that is, Pı and P2 are glued along a lower-dimensional subset. Then at each 
lattice point m € Zf, wp, (m) + wp, (m) = wp(m), and so the function Ap 
has an additive property: 


Ap(t) = Ap, (t) + Ap, (t). (11.1) 
In contrast, the Ehrhart counting functions satisfy 
Lp (t) = Lp, (t) + Lp, (t) = LPNP: (t) : 


On the other hand, we can transfer computational effort from the Ehrhart 
counting functions to the solid-angle sum and vice versa, with the use of the 
following lemma. 


Lemma 11.2. Let P be a polytope. Then 


Ap(t) = X` wp(F) Lro(t). 


FOP 


Proof. The dilated polytope tP is the disjoint union of its relative open faces 
tF°, so that we can write 


Ap(t)= X` wp(m)= ` S > upm) lige (m). 


mEZ4 FCP meZ4 


But wp(m) is constant on each relatively open face tF°, and we called this 
constant wp(F), whence 


Ap(t) = X we(F) X lige(m) = X wp(F) Lro(t). 


FCP meZ¢ FCP 














Thus Ap(t) is a polynomial (respectively quasipolynomial) in ¢ for an 
integral (respectively rational) polytope P. We claim that Lemma 11.2 is in 
fact useful in practice. To drive the point home, we illustrate this identity by 
computing the solid-angle sum over all integer points of A in Example 11.1. 
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Example 11.3. We continue the solid-angle computation for the 3-simplex 
A = conv { (0, 0,0), (1, 0,0), (0,1, 0), (0,0, 1)}. We recall from Section 2.3 that 
Lao = Cai The facets of A are three standard triangles and one triangle 
that appeared in the context of the Frobenius problem. All four facets have 
the same interior Ehrhart polynomial ere A similar phenomenon holds for 
the edges of A: all six of them have the same interior Ehrhart polynomial 
t — 1. These polynomials add up, by Lemma 11.2 and Example 11.1, to the 


solid-angle sum 


Aa(t) = COETS + (3-5 +3: com (=) )@-» 
shee Gh) 


ol} f1 5 
mae + (Seo (=) aye 


The magic cancellation of the even terms of this polynomial is not a coinci- 
dence, as we will discover in Theorem 11.7. The curious reader may notice 
that the coefficient of t in this example is not a rational number, in stark 
contrast with Ehrhart polynomials. 














The analogue of Ehrhart’s theorem (Theorem 3.23) in the world of solid 
angles is as follows. 


Theorem 11.4 (Macdonald’s theorem). Suppose P is a rational convex 
d-polytope. Then Ap is a quasipolynomial of degree d whose leading coefficient 
is volP and whose period divides the denominator of P. 


Proof. The denominator of a face F C P divides the denominator of P, and 
hence so does the period of L, by Ehrhart’s theorem (Theorem 3.23). By 
Lemma 11.2, Ap is a quasipolynomial with period dividing the denominator 
of P. The leading term of Ap equals the leading term of Lpo, which is vol P, 
by Corollary 3.20 and its extension in Exercise 3.29. 














11.2 Solid-Angle Generating Functions 
and a Brion-Type Theorem 


By analogy with the integer-point transform of a polyhedron P C Rĉ, which 
lists all lattice points in P, we form the solid-angle generating function 


ap(z):= > wp(m)z™. 
mePnZ4 


Using the same reasoning as in (11.1) for Ap, this function satisfies a nice 
additivity relation. Namely, if the d-polyhedron P equals Pı U P2, where 
dim (P1 N P2) < d, then 
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ap(z) = ap, (2) + ap, (2). (11.2) 


This generating function obeys the following reciprocity relation, which par- 
allels both the statement and proof of Theorem 4.3: 


Theorem 11.5. Suppose K is a rational pointed d-cone K with the origin as 
apex, and v € RÌ. Then the solid-angle generating function ay+x(z) of the 
pointed d-cone v + K is a rational function that satisfies 


avse (2) = (-1)favar (0). 


Proof. Because solid angles are additive by (11.2), it suffices to prove this the- 
orem for simplicial cones. The proof for this case proceeds along the same lines 
as the proof of Theorem 4.2; the main geometric ingredient is Exercise 4.2. 
We invite the reader to finish the proof (Exercise 11.4). 














The analogue of Brion’s theorem in terms of solid angles is as follows. 


Theorem 11.6. Suppose P is a rational convex polytope. Then we have the 
following identity of rational functions: 


ap(z) = 5 aç, (z). 
v a vertex of P 
Proof. As in the proof of Theorem 9.7, it suffices to prove Theorem 11.6 for 
simplices. So let A be a rational simplex. We write A as the disjoint union of 
its open faces and use Brion’s theorem for open polytopes (Exercise 9.7) on 
each face. That is, if we denote the vertex cone of F at vertex v by Ky (F), 
then by a monomial version of Lemma 11.2, 


aa (z) = X. wa(F) oFe(2) 


FCA 


= So oaz So waAlF) So oxympl(z), 


v a vertex of A FCA v a vertex of F 
dim F>0 


where we used Brion’s theorem for open polytopes (Exercise 9.7) in the second 
step. By Exercise 11.5, 


Y aA SO owel > XO wk (F)or(z), 


FCA v a vertex of F v a vertex of A FCKy 
dim F>0 dim F>0 
and so 
v 
aaz) = E wawt E E a, (Fore(2) 
v a vertex of A v a vertex of A FCKy 
dim F>0 


= 5 XO we, (F) oe (2) 


v a vertex of A FCKy, 


= 5 aK, (z). 


v a vertex of A 
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11.3 Solid-Angle Reciprocity and the Brianchon—Gram 
Relations 


With the help of Theorems 11.5 and 11.6, we can now prove the solid-angle 
analogue of Ehrhart—Macdonald reciprocity (Theorem 4.1): 


Theorem 11.7 (Macdonald’s reciprocity theorem). Suppose P is a ra- 
tional convex polytope. Then the quasipolynomial Ap satisfies 


Ap(—t) = (-1)%™? Ap(t). 


Proof. We give the proof for an integral polytope P and invite the reader to 
generalize it to the rational case. The solid-angle counting function of P can 
be computed through the generating function: 


Ap(t) = ap (1, 1, tees 1) _ lim arp (Z) . 
By Theorem 11.6, 


Ap(t) = lim ark, (Z), 
a oe ag 

where Ky is the tangent cone of P at the vertex v. We write Ky =v +K(v), 
where K(v) := Ky — v is a rational cone with the origin as its apex. Then 
tk, = tv + K(v) because a cone whose apex is the origin does not change 
under dilation. Hence we obtain, with the help of Exercise 11.3, 


Apt) =lm ows) = Him J,  z“acw(2). 
v a vertex of P v a vertex of P 


The rational functions axy)(z) on the right-hand side do not depend on t. If 
we think of the sum over all vertices as one big rational function, to which we 
apply L’H6pital’s rule to compute the limit as z — 1, this gives an alternative 
proof that Ap(t) is a polynomial, in line with our proof for the polynomiality 
of Lp(t) in Section 9.4. At the same time, this means we can view the identity 


Ap(t) = lim za Z 
P l ee oe K 

in a purely algebraic fashion: on the left-hand side we have a polynomial that 

makes sense for any complex t, and on the right-hand side we have a rational 

function of z, whose limit we compute, for example, by L’Hopital’s rule. So 

the right-hand side, as a function of t, makes sense for any integer t. Hence 

we have the algebraic relation, for integral t, 


ee riji —tv 
Ap(—t) = lim > jae aK (wv) (z). 
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But now by Theorem 11.5, axy)(z) = (—1)4axyyy (4), and so 


1 
: —tv d 
Ap(-t) = jim 2 ae t (—1) QK(v) G) 


sra, EC) ma) 


v a vertex of P 


S 1i 
= (—1)* lim 5 Qtv+K(v) G) 
v a vertex of P 


= D aee (3) 
= (-1)*Ap(t). 


In the third step we used Exercise 11.3 again. 

This proves Theorem 11.7 for integral polytopes. The proof for rational 
polytopes follows along the same lines; one deals with rational vertices in the 
same manner as in our second proof of Ehrhart’s theorem in Section 9.4. We 
invite the reader to finish the details in Exercise 11.6. 











We remark that throughout the proof, we cannot simply take the limit 
inside the finite sum over the vertices of P, since z = 1 is a pole of each rational 
function ax(v). It is precisely the magic of Brion’s theorem that makes these 
poles cancel each other, to yield Ap(t). 

If P is an integral polytope, then Ap is a polynomial, and Theorem 11.7 
tells us that Ap is always even or odd: 





Ap (t) = Cd i? + ep ett tere +eg. 
We can say more. 
Theorem 11.8. Suppose P is a rational convex polytope. Then Ap(0) = 0. 


This is a meaningful zero. We note that the constant term of Ap is given by 


Ap(0) = X we(F) Lro(0) = $ wp(F)(-)%™*, 


FCP FCP 


by Lemma 11.2 and Ehrhart-Macdonald reciprocity (Theorem 4.1). Hence 
Theorem 11.8 implies a classical and useful geometric identity: 


Corollary 11.9 (Brianchon—Gram relation). For a rational convex poly- 
tope P, 
> (-1)9" wp (F) = 0. 


FCP 
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Example 11.10. Consider a triangle 7 in R? with vertices vj,v2,v3 and 
edges FE, E2, E3. The Brianchon—Gram relation tells us that for this triangle, 


WT (v1) + wy (v2) + Wy (v3) = (wr (E1) + Wy (E2) + Wy (E3)) + Wy (T) =0. 
Since the solid angles of the edges are all 5 and wy (T) = 1, we recover 
our friendly high-school identity “the sum of the angles in a triangle is 180 
degrees”: 
1 
wr (v1) + wr (v2) + w7 (v3) = 5° 
Thus the Brianchon—Gram relation is the extension of this well-known fact to 


any dimension and any convex polytope. 

















Proof of Theorem 11.8. It suffices to prove Aa(0) = 0 for a rational simplex 
A, since solid angles of a triangulation simply add, by (11.1). Theorem 11.7 
gives A, (0) = 0 if dim A is odd. 

So now suppose A is a rational d-simplex, where d is even, with vertices 
Vi, V2,---,Va+1- Let P(n) be the (d+1)-dimensional pyramid that we obtain 
by taking the convex hull of (v1, 0) ,(v2,0),..., (wa41,0), and (0,0,...,0,7), 
where n is a positive integer (see Figure 11.1). Note that, since d+ 1 is odd, 


n 





Tı 


Fig. 11.1. The pyramid P(n) for a triangle A. 


AP O= DO D opn (F(n)) = 0. 
F(n)CP(n) 


We will conclude from this identity that Y pca (1) Fwa (F) = 0, which 
implies that Aa(0) = 0. To this end, we consider two types of faces of P(n): 


(a) those that are also faces of A, and 
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(b) those that are not contained in A. 


We start with the latter: Aside from the vertex (0,0,...,0,), every face F(n) 
of P(n) that is not a face of A is the pyramid over a face G of A; let’s denote 
this pyramid by Pyr (G,n). Further, as n grows, the solid angle of Pyr (G,n) 
(in P(n)) approaches the solid angle of G (in A): 


jim, WP(n) (Pyr (G, n)) =WA (G) ’ 


since we’re forming A x [0, co) in the limit. On the other hand, a face F(n) = G 
of P(n) that is also a face of A obeys the following limit behavior: 


1 
dim wp(n) (F(n)) = 5 wa (9). 


The only face of P(n) that we still have to account for is the vertex v := 
(0,0,...,0,n). Hence 


0 = (A F pn (F(n)) 
F(n)EP(n) 


= wpn) (v) + X (-1)9" wn) (Pyr (G, n)) 
GTA 


HI OD apay (G)- 


GCA 


Now we take the limit as n — oo on both sides; note that limn—o wp(n)(V) = 
0, so that we obtain 


0= 5 (yet ax (G) 5 ( jaag ! WA (G) 

















ae GCA 
as De Hiag a (G), 
2 STA 
and so 
Aa (0) = X (-1)9™Swa(G) = 0 
GCA 


The combination of Theorems 11.7 and 11.8 implies that summing solid 
angles in a polygon is equivalent to computing its area: 


Corollary 11.11. Suppose P is a 2-dimensional integral polytope with area 
A. Then Ap(t) = At?. 
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11.4 The Generating Function of Macdonald’s 
Solid-Angle Polynomials 


We conclude this chapter with the study of the solid-angle analogue of Ehrhart 
series. Given an integral polytope P, we define the solid-angle series of P as 
the generating function of the solid-angle polynomial, encoding the solid-angle 
sum over all dilates of P simultaneously: 


Solidp(z) := $- Ap(t) 2". 
t>0 


The following theorem is the solid-angle analogue to Theorems 3.12 and 4.4, 
with the added bonus that we get the palindromy of the numerator of Solidp 
for free. 


Theorem 11.12. Suppose P is an integral d-polytope. Then Solidp is a ra- 
tional function of the form 


d d—1 a 
Solidp(z) = Qaz” + a a + az 
nE 





Furthermore, we have the identity 
1 
Solidp C) = (-1)"*' Solidp(z) 
z 


or, equivalently, ak = ad+1-k for 1 < k < g. 


Proof. The form of the rational function Solidp follows by Lemma 3.9 from 
the fact that Ap is a polynomial. The palindromy of a1, a2, . . . , aq is equivalent 
to the relation 


Solidp (=) = (—1)4*! Solidp(z), 
which, in turn, follows from Theorem 11.7: 
Solidp(z) = X Ap(t) 2! = $°(-1)*Ap(-t) 2! = (-1)4 > Ap(t) 2. 
t>0 t>0 t<0 
Now we use Exercise 4.5: 


(-1)4 XC Ap(t) 2™ = (-1)*" X Ap(t) 2 = (-1)**" Solidp (+) 


t<0 t>1 





In the last step we used the fact that Ap (0) = 0 (Theorem 11.8). 
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Notes 


1. I. G. Macdonald inaugurated the systematic study of solid-angle sums in 
integral polytopes. The fundamental Theorems 11.4, 11.7, and 11.8 can be 
found in his 1971 paper [123]. The proof of Theorem 11.7 we give here follows 
[26]. 


2. The Brianchon—Gram relation (Corollary 11.9) is the solid-angle analogue 
of the Euler relation for face numbers (Theorem 5.2). The 2-dimensional case 
discussed in Example 11.10 is ancient; it was most certainly known to Euclid. 
The 3-dimensional case of Corollary 11.9 was discovered by Charles Julien 
Brianchon in 1837 and—as far as we know—was independently reproved by 
Jorgen Gram in 1874 [88]. It is not clear who first proved the general d- 
dimensional case of Corollary 11.9. The oldest proofs we could find were from 
the 1960s, by Branko Griinbaum [90], Micha A. Perles, and Geoffrey C. Shep- 
hard [142, 162]. 


3. Theorem 11.5 is a particular case of a reciprocity relation for simple lattice- 
invariant valuations due to Peter McMullen [128], who also proved a parallel 
extension of Ehrhart—Macdonald reciprocity to general lattice-invariant val- 
uations. There is a current resurgence of activity on solid angles; see, for 
example, [52]. 


Exercises 


11.1. Compute Ap(t), where P is the regular tetrahedron with vertices 
(0,0,0), (1,1,0), (1,0,1), and (0,1,1) (see Exercise 2.13). 


11.2. Compute Ap(t), where P is the rational triangle with vertices (0,0), 
(5,5), and (1,0). 


11.3. & Let K be a rational d-cone and let m € Zt. By analogy with Exer- 
cise 3.5, show that am+x(z) = z™ax (z). 


11.4. & Complete the proof of Theorem 11.5: For a rational pointed d-cone 
K, ax(z) is a rational function that satisfies 


ak (z) = (-1)4ax (z). 


11.5. & Suppose A is a rational simplex. Prove that 


X wa(F) X o okelaz) = 5 XO we (F) ore (2). 


FCA v a vertex of F v a vertex of A FCKy 
dim F>0 dim F>0 
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11.6. & Provide the details of the proof of Theorem 11.7 for rational poly- 
topes: Prove that if P is a rational convex polytope, then the quasipolynomial 


Ap satisfies 
Ap(-t) = (-1)™P Ap(t). 


11.7. Recall from Exercise 3.1 that to any permutation 7 € Sq on d elements 
we can associate the simplex 


Ax := conv {0, €701); €701) + €r(2) «+ +s €n(1) + en(2) +++ + en(a) }- 
Prove that for all r € Sn, Aa, (t) = 7 tê. 


11.8. Give a direct proof of Corollary 11.11, e.g., using Pick’s theorem (The- 
orem 2.8). 


11.9. State and prove the analogue of Theorem 11.12 for rational polytopes. 


Open Problems 


11.10. Study the roots of solid-angle polynomials. 


11.11. Classify all polytopes that have only rational solid angles at their ver- 
tices. 


11.12. Among all d-simplices A, does the regular d-simplex have the property 
that oy a vertex of AWA(V) is a minimum? 


11.13. Which integral polytopes P have solid-angle polynomials Ap(t) € 
Q[t]? That is, for which integral polytopes P are all the coefficients of Ap(t) 
rational? 


11.14. Are there solid-angle polynomials with negative coefficients? 


11.15. Does the numerator polynomial of the rational generating function 
Solidp(z) always have nonnegative coefficients, by analogy with Theorem 3.12? 


12 





A Discrete Version of Green’s Theorem 
Using Elliptic Functions 


The shortest route between two truths in the real domain passes through the complex 
domain. 


Jacques Salomon Hadamard (1865-1963) 


We now allow ourselves the luxury of using basic complex analysis. In par- 
ticular, we assume that the reader is familiar with contour integration and 
the residue theorem. We may view the residue theorem as yet another re- 
sult that intimately connects the continuous and the discrete: it transforms a 
continuous integral into a discrete sum of residues. 

Using the Weierstra8 p and ¢ functions, we show here that Pick’s theorem 
is a discrete version of Green’s theorem in the plane. As a bonus, we also 
obtain an integral formula (Theorem 12.5 below) for the discrepancy between 
the area enclosed by a general curve C and the number of integer points 
contained in C. 


12.1 The Residue Theorem 


We begin this chapter by reviewing a few concepts from complex analysis. 
Suppose the complex-valued function f has an isolated singularity w € G; 
that is, there is an open set G C C such that f is analytic on G \ {w}. Then 
f can be expressed locally by the Laurent series 


f(z) =o en(z-w)”, 
nEZ 
valid for all z € G; here cn € C. The coefficient c_, is called the residue of f 
at w; we will denote it by Res(z = w). The reason to give c—ı a special name 
can be found in the following theorem. We call a function meromorphic if 
it is analytic in C with the exception of isolated poles. 
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Theorem 12.1 (Residue theorem). Suppose f is meromorphic and C is 
a positively oriented, piecewise differentiable, simple, closed curve that does 
not pass through any pole of f. Then 


17 = a De Bee 


where the sum is taken over all singularities w inside C. 














If f is a rational function, Theorem 12.1 gives the same result as the 
partial fraction expansion of f. We illustrate this philosophy by returning to 
the elementary beginnings of Chapter 1. 


Example 12.2. Recall our constant-term identity for the restricted partition 
function for A = {a1,a2,...,aqa} in Chapter 1: 





1 
pa(n) = const (Garces) . 


Computing the constant term of the Laurent series of T 





1 
1—221)-.-(1—z%d)z” ex 
panded about z = 0 is, naturally, equivalent to “shifting” this function by one 


exponent and computing the residue at z = 0 of the function 


1 
f(z) := (1 — 291) (1 — 292)... (1 — z@a) grt” 





Now let C be a positively oriented circle of radius r > 1, centered at the 
origin. The residue Res(z = 0) = p4 (n) is one of the residues that are picked 
up by the integral 


Hae f = Res(z a 


where the sum is over all honzeto poles w of f that lie inside C,. These 


poles are at the at}, atP,...,atP roots of unity. Moreover, with the help of 


Exercise 12.1 we can show that 


roo 2ri Jo, 
= lim (Rows =0)+ N> Res(z = ») 
= Res(z = 0) + > Res(z = w), 


where the sum extends over all at}, at?,..., at? roots of unity. In other words, 


pa(n) = Res(z = 0) sae, 
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To obtain the restricted partition function p4, it remains to compute the 
residues at the roots of unity, and we invite the reader to realize that this 
computation is equivalent to the partial fraction expansion of Chapter 1 (Ex- 
ercise 12.2). 














Analogous residue computations could replace any of the constant-term 
calculations that we performed in the earlier chapters. 


12.2 The WeierstraB g and ¢ Functions 


The main character in our play is the Weierstraß ¢-function, defined by 


1 1 5 
i D (= 'm+ni ` 5) (12.1) 


(m,n)EZ?\ (0,0) 





Rie 


(z) = 


This infinite sum converges absolutely for z belonging to compact subsets of 
the lattice-punctured plane C \ Z? (Exercise 12.4), and hence forms a mero- 
morphic function of z. 

The Weierstra8 ¢-function possesses the following salient properties, which 
follow immediately from (12.1): 


(1) ¢ has a simple pole at every integer point m+ni and is analytic elsewhere. 
(2) The residue of ¢ at each integer point m + ni equals 1. 


We can easily check (Exercise 12.5) that 


p@=-M=5+ E (— z 2); 


(mn)eZ\ (0,0) m + ni)) (m+ nt) 

(12.2) 
the Weierstraß ¢-function. The g-function has a pole of order 2 at each 
integer point m+ ni and is analytic elsewhere, but has residue equal to zero at 
each integer point m-+ nz. However, p possesses a very pleasant property that 
¢ does not: p is doubly periodic on C. We may state this more concretely: 





Lemma 12.3. p(z + 1) = p(z + i) = p(z). 


Proof. We first invite the reader to prove the following two properties of g’ 
(Exercises 12.6 and 12.7): 


g (z+1)= p(z), (12.3) 


ri g (z)dz is path independent. (12.4) 


Zo 


By (12.3), 
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so p(z +1) — p(z) = c for some constant c. On the other hand, p is an even 
function (Exercise 12.8), and so z = —} gives us 


c= 9(3) —p(-4) = 9. 


This shows that p (z + 1) = p(z) for all z € C\Z?. An analogous proof, which 
we invite the reader to construct in Exercise 12.9, shows that p (z + i) = p(z). 














Lemma 12.3 implies that p(z + m+ ni) = p(z) for all m,n € Z. The 
following lemma shows that the Weierstra% ¢-function is only a conjugate- 
analytic term away from being doubly periodic. 


Lemma 12.4. There is a constant a such that the function ¢(z)+azZ is doubly 
periodic with periods 1 and i. 


Proof. We begin with w = m + nt: 


m+ni 
G(e+mtni)— C2) =- | p(z +w)dw, (12.5) 
w=0 
by definition of p(z) = —¢’(z). To make sure that (12.5) makes sense, we 


should also check that the definite integral in (12.5) is path independent (Ex- 
ercise 12.10). 
Due to the double periodicity of p, 


m+ni 1 1 
if ole +w)dw =m | p(ettjattni | p(z +it)dt 
w=0 0 0 


= ma(z) + niG(z), 


where 


a(z) =j o(z+t) dt and B(z) =f (z+ it) dt. 


Now we observe that a(z + zo) = a(z) for any zo E R, so that a (a+ iy) 
depends only on y. Similarly, 6 (x + iy) depends only on x. But 


C (z +m + in) — ¢(z) = — (may) + inG(z)) 


must be analytic for all z € C \ Z?. If we now set m = 0, we conclude that 
G(x) must be analytic in C \ Z?, so that B(x) must be a constant by the 
Cauchy—Riemann equations for analytic functions. Similarly, setting n = 0 
implies that a(y) is constant. Thus 


Ç (z +m + in) — ¢(z) = — (ma + inp) 


with constants a and 8. Going back to the Weierstra8 g-function, we can 
integrate the identity (Exercise 12.11) 
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pliz) = —9(z) (12.6) 
to obtain the relationship 6 = —a, since 
1 1 1 
B =| ole +itjat= | p (it) dt = -f p (t)dt =-a. 
0 0 0 
To summarize, we have 


C(z+m+in) — C(z) = -ma + ina = —a(z+m+in—Z), 











so that ¢(z) + aZ is doubly periodic. 





12.3 A Contour-Integral Extension of Pick’s Theorem 
For the remainder of this chapter, let C be any piecewise-differentiable, simple, 


closed curve in the plane, with a counterclockwise parametrization. We let D 
denote the region that C contains in its interior. 


Theorem 12.5. Let C avoid any integer point, that is, C N Z? = Ø. Let I 
denote the number of integer points interior to C, and A the area of the region 
D enclosed by the curve C. Then 


1 
F) (¢(z) —mzZ)dz =I- A. 
2ri Jo 
Proof. We have 
i (¢(z) + aZ) dz = f C(z)dz + af (x — iy) (dx + idy). 
C C Cc 
By Theorem 12.1, fe Ç(z) dz is equal to the sum of the residues of ¢ at all of 


its interior poles. There are J such poles, and each pole of ¢ has residue 1. 
Thus 


201 


1 
— f, C(z)dz=T. (12.7) 


On the other hand, Green’s theorem tells us that 
| (a — ty) (dx + idy) =} (a — ty) dx + (y + ix) dy 
Cc 


C 
o . o : 
= | g 0t- 5 E) 


= ff z 


=2iA. 


196 12 A Discrete Version of Green’s Theorem Using Elliptic Functions 


Going back to (12.7), we get 
1 (C(z) + aZ) dz = 2ril + a (2i A). (12.8) 
c 


We only have to show that œ = —r. Consider the particular curve C that is 
a square path, centered at the origin, traversing the origin counterclockwise, 
and bounding a square of area 1. Thus J = 1 for this path. Since ¢(z) + az 
is doubly periodic by Lemma 12.4, the integral in (12.8) vanishes. We can 
conclude that 

0 = 27i-14+a(2%-1), 











so that a = —7. 





Notice that Theorem 12.5 has given us information about the Weierstraf 
¢-function, namely that a = —7. 

This chapter offers a detour into an infinite landscape of discrete results 
that meet their continuous counterparts. Equipped with the modest tools 
offered in this book, we hope we have motivated the reader to explore this 
landscape further... 


Notes 


1. The Weierstra8 o-function, named after Karl Theodor Wilhelm Weier- 
stra8 (1815-1897),1 can be extended to any two-dimensional lattice £ = 
{kwi + jwo: k,j E€ Z} for some wi,w2 E€ C that are linearly independent 


over R: 


me L\{0} 


The Weierstraf g¢-function and its derivative g satisfy a polynomial rela- 
tionship, namely, (o) = 4 (p£)? — g2 p£ — g3 for some constants gz and g3 
that depend on £. This is the beginning of a wonderful friendship between 
complex analysis and elliptic curves. 


2. Theorem 12.5 appeared in [75]. There it is also shown that one can retrieve 
Pick’s theorem (Theorem 2.8) from Theorem 12.5. 
Exercises 


12.1. & Show that for positive integers a1, đa, ..., ad, ^n, 


1 For more information about Weierstraß, see 
http: //www-groups.dcs.st-and.ac.uk/~history/Biographies/Weierstrass.html. 
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1 


li =0. 
Ree Cp (1 — za) oe (1 = zea) grt] 





This computation shows that the integrand above “has no pole at infinity.” 


12.2. & Compute the residues at the nontrivial roots of unity of 


1 
f(z) = (1 — 2%)... (1 — 24a) zn+1 d 





For simplicity, you may assume that a1,a2,...,@q are pairwise relatively 
prime. 


12.3. Give an integral version of Theorem 2.13. 
12.4. & Show that 





1 4 y ( 1 i 1 i zZ ) 
cole E A T è T Ro) 
%  (m,n)ez2\(0,0) \* (m+ni) m+ni (m+ni) 
converges absolutely for z belonging to compact subsets of C \ Z?. 


12.5. d Prove (12.2), that is, 
1 1 i 
Cazman ( A2 a) 
(m,n)EZ?\ (0,0) (z — (m + ni)) (m+ ni) 
12.6. & Prove (12.3), that is, show that g (z + 1) = g' (2). 


12.7. & Prove (12.4), that is, show that for any zo, 21 € C \2?, ie p (w) dw 
is path independent. 


12.8. & Show that p is even, that is, p (—z) = p(z). 
12.9. & Finish the proof of Lemma 12.3 by showing that p (z + i) = p(z). 
12.10. & Prove that the integral in (12.5), 








w=m+ni 
ce+m+ni)- ie) =- f p(z+w)dw, 


w=0 


is path independent. 
12.11. & Prove (12.6), that is, p(iz) = —e(z). 


Open Problems 


12.12. Can we get even more information about the Weierstraß p and Ç func- 
tions by using more detailed knowledge of the discrepancy between J and A 
for special curves C? 


12.13. Find a complex-analytic extension of Theorem 12.5 to higher dimen- 
sions. 


A 





Vertex and Hyperplane Descriptions 
of Polytopes 


Everything should be made as simple as possible, but not simpler. 


Albert Einstein 


In this appendix, we prove that every polytope has a vertex and a hyperplane 
description. This appendix owes everything to Günter Ziegler’s beautiful ex- 
position in [193]; in fact, these pages contain merely a few cherries picked from 
[193, Lecture 1]. 

As in Chapter 3, it is easier to move to the world of cones. To be as concrete 
as possible, let us call X£ C R? an h-cone if 


K={xeR’: Ax <0} 


for some A € R™*4; in this case K is given as the intersection of m halfspaces 
determined by the rows of A. We use the notation K = hcone(A). 
On the other hand, we call K C Rt a v-cone if 


K={By: y >20} 


for some B € RÎ*”, that is, K is a pointed cone with the column vectors of 
B as generators. In this case we use the notation K = vcone(B). 

Note that, according to our definitions, any h- or v-cone contains the origin 
in its apex. We will prove that every h-cone is a v-cone and vice versa. More 
precisely: 


Theorem A.1. For every A € R™*4 there exists B € R?*” (for some n) 
such that hcone(A) = vcone(B). Conversely, for every B € R?*" there exists 
A E€ R™*4 (for some m) such that vcone(B) = hcone(A). 


We will prove the two halves of Theorem A.1 in Sections A.1 and A.2. For 
now, let us record that Theorem A.1 implies our goal, that is, the equivalence 
of the vertex and halfspace description of a polytope: 
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Corollary A.2. If P is the convex hull of finitely many points in RÌ, then P 
is the intersection of finitely many half spaces in RÌ. Conversely, if P is given 
as the bounded intersection of finitely many half spaces in R?, then P is the 
conver hull of finitely many points in R. 


Proof. If P = conv {v1, V2,..., Vn} for some v1, V2,..., Vn E Rf, then coning 
over P (as defined in Chapter 3) gives 


= Vi V2 ... Vn 
cone(P) = vcone ( 11 1 ) ; 


By Theorem A.1 we can find a matrix (A, b) € R™*‘4) such that 
cone(P) = hcone(A, b) = {x € R™ : (A, b)x < 0}. 
We recover the polytope P upon setting £a+ı = 1, that is, 
P={xeR?: Ax < -b}, 


which is a hyperplane description of P. 
These steps can be reversed: Suppose the polytope P is given as 


P={xeR’: Ax < -b} 
for some A € R™*4 and b € R™. Then P can be obtained from 
hcone(A, b) = {x € R“*!: (A,b)x < 0} 


by setting ra+ı = 1. By Theorem A.1 we can construct a matrix B € R(4+))*" 
such that 


hcone(A, b) = vcone(B) . 


We may normalize the generators of vcone(B), that is, the columns of B, such 
that they all have (d + 1)st variable equal to one: 


2 VI V2... Vn 
a a 


Since P can be recovered from vcone(B) by setting za+}ı = 1, we conclude 
that P = conv {vi, V2,..., Vn}. 














A.1 Every h-cone is a v-cone 


Suppose 
K = hcone(A) = {x € R’: Ax < 0} 


for some A € R™*4, We introduce an auxiliary m-dimensional variable y and 
write 
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k= { (2) erm: axsyln{(*) erm: y=}. (A.1) 


(Strictly speaking, this is K lifted into a d-dimensional subspace of R¢+™.) 
Our goal in this section is to prove the following two lemmas. 


Lemma A.3. The h-cone {() ERT”: Ax< y} is a v-cone. 


Lemma A.4. If K C R? is a v-cone, then so is KN {x ERI: zp = 0}, for 
any k. 


The first half of Theorem A.1 follows with these two lemmas, as we can start 
with (A.1) and intersect with one hyperplane yp = 0 at a time. 


Proof of Lemma A.3. We start by noting that 


c= {G)een ans} 
{E anG) 


is an h-cone; here I represents an m x m identity matrix. Let us denote the 
k* unit vector by ex. Then we can decompose 


-ele mem (2) 


j=l 
d 


-Sisme (£ J+- a00 (2). 

















Note that if G) E€ K then y, — (Ax), > 0 for all k, and so (5) can be written 
as a nonnegative linear combination of the vectors sign (xj) (x, J TSJ <d, 
and (aos 1< k< m. But this means that K is a v-cone. 














Proof of Lemma A.4. Suppose K = vcone(B), where B has the column vec- 
tors bı, b2,...,bn € R; that is, b1, b2,..., bn are the generators of K. Fix 
k < d and construct a new matrix Bẹ whose column vectors are all b; for 
which bj = 0, and the combinations bib; — b;,b; whenever bj, > 0 and 
bjk < 0. We claim that 


Kn {xeR?: £p = 0} = veone (Bg). 


Every x € vcone(B,) satisfies x, = 0 by construction of Bẹ, and so 
vcone (B) C KN {x ERT: £p = 0} follows immediately. We need to do some 
more work to prove the reverse containment. 
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Suppose x € Kn{x ERT: zk = 0}, that is, x = 1 b1 + A2b2 +- -+ An bn 
for some A1, À2,..., Àn > 0 and £k = A1b1k + A2b2k ++- + Anbnk = 0. This 


allows us to define 
XO Aibix =— Y Abjir- 
i: bin >O j: bjk<0 


Note that A > 0. Now consider the decomposition 
X o Abj+ X àbit SS Abs. (A.2) 
j: bjk=0 i: bik>0 j: bjk <0 


If A = 0 then \;b;, = 0 for all i such that big > 0, and so A; = 0 for these i. 
Similarly, A; = 0 for all j such that bj, < 0. Thus we conclude from A = 0 
that 
5 Ajb; € vcone (Bx). 
j: bj~=0 


Now assume A > 0. Then we can expand the decomposition (A.2) into 


= E b+ — So Abi i wn 





wie bjk= 0 kE bjk <0 i: bik>0 
ulg (ze 
re >O j: bjr <0 
= So àbj+ 1 SO Aidy (bib — bjrb;), 
j:bjk=0 i: bip >0 
j:bjk<0 





which is by construction in vcone (Bx). 











A.2 Every v-cone is an h-cone 


Suppose 
K = vcone(B) = {By: y > 0} 
for some B € R?*”. Then K is the projection of 


{(*) eR. y20.x=By} (A.3) 


to the subspace {() ERM": y= o}. The constraints for (A.3) can be writ- 
ten as 

x 

y>0 and a -B) (X) =0. 

y 
Thus the set (A.3) is an h-cone, for which we can project one component of 
y at a time to obtain K. This means that it suffices to prove the following 
lemma to finish the second half of Theorem A.1. 
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Lemma A.5. If K is an h-cone, then the projection {x — £kep : x E€ K} is 
also an h-cone, for any k. 


Proof. Suppose K = hcone(A) for some A € R™*4, Fix k and consider 
Pr={xtre,: XEK, ACR}. 
The projection we’re after can be constructed from this set as 
{x — xpex : xEK}=P,N{xER*: zk =0}, 


so that it suffices to prove that Pk is an h-cone. 

Suppose aj, a2,...,am are the row vectors of A. We construct a new ma- 
trix A; whose row vectors are all a; for which a;, = 0, and the combinations 
Qikaj — ajka; Whenever aik > 0 and ajk < 0. We claim that Py = hcone (Ax). 

If x € K then Ax < 0, which implies A; x < 0 because each row of Ax is 
a nonnegative linear combination of rows of A; that is,  C hcone (Ap). How- 
ever, the kt! component of Ax is zero by construction, and so K C hcone (Ax) 
implies P;, C hcone (Ax). 

Conversely, suppose x € hcone (Ax). We need to find a A € R such that 
A (x — Aex) < 0, that is, 


aiti ++++ + aik (Ek — A) +--+ + a1ata <0 





amiz +: + amk (£k — A) + +++ + Gmata <0. 


The jt? constraint is aj: X—aj,A < 0, that is, aj -x < aj,A. This gives the 
following conditions on A: 








ai X ) 

A> 4 if aik > 0, 
Qik 
a; x : 

vA< 4 if ajk < 0. 
Qjk 


Such a A exists because if aj, > 0 and aj, < 0 then (since x € hcone (A;)) 
(aikaj — ajkai) ‘xX < 0, 


which is equivalent to 
aj: X 2 aj’ X 











Qik Qjk 
Thus we can find a A that satisfies 
aj: X a; x 
T LAs L, 
Qik Qjk 














which proves hcone (Ax) C Pp. 


B 





Triangulations of Polytopes 


Obvious is the most dangerous word in mathematics. 


Eric Temple Bell 


The goal of this appendix is to prove Theorem 3.1. Recall that a triangula- 
tion of a convex d-polytope P is a finite collection T of d-simplices with the 
following properties: 


e P=UA. 


AET 
e For any Aj, Ag E T, A1 N Ag is a face common to both A; and Ag. 
Theorem 3.1 says that P can be triangulated using no new vertices, that is, 
there exists a triangulation T such that the vertices of any A € T are vertices 
of P. In preparation, we first show that a triangulation of a polytope induces 
a triangulation on any of its facets in a natural way. 


Lemma B.1. Suppose T(P) is a triangulation of the d-polytope P, and F is 
a facet of P. Then 


T(F) :={SAF:S€T(P), dim(SNF)=d—-1} 
is a triangulation of F. 


Proof. To avoid unnecessary notation, we write UT(F) for Userr) A. We 
have to show: 


(i) F=UT(F). 

(ii) For any Ay, A2 E€ T(F), A1 N Ag is a face common to both A; and Ag. 
(i) First, UT(F) C F by definition of T(F). Now we will show that F \ 
UT(F) = ©, by means of contradiction. Let x € F \UT(F). If there is a 
neighborhood N of x in F that contains no points of UJ T(F), then N consists 
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only of points contained in some simplices from T(P) that meet F in a set of 
dimension less than d — 1, which is impossible since dim N = d — 1 and there 
are only finitely many simplices in T(P). Hence any neighborhood of x in F 
contains points in some A € T(F). However, JT(F) is closed, and so such 
an x cannot exist. Hence F \ UT(F) = Ø. 


(ii) Given Aj, A2 € T(F), there are S1, S2 E€ T(P) such that 
A, =Si NF and Ag = SoNF 


and the intersections of both Sı and S2 with F are (d — 1)-dimensional. Now 
Ai N A2 = S1 N S2 N F, and since S1, S2 E T(P), S1 N S2 is a face of both Sı 
and S2. That is, there are hyperplanes Hı and H3 in R? such that 


S1 N S2 = & N Hı and S1 N S2 = & N He. 
The (d — 1)-hyperplanes H1, Hə in Rt induce the (d — 2)-hyperplanes 
hı := Hı N span F and hə := Hə N span F 


in span F. We claim that hı N A, = A1 N Ag = h2 N Ag, that is, A1 O Ag isa 
face of both A, and Ag. Indeed, 
hyN Ay = hin (Si NF) 
= (Hı Aspan F)N (SiN F) 
= (Hi N S1) N (F N span F) 
= (81 N82) NF 
= 4AN A2, 











and a practically identical calculation gives hy N Ag = A1 N Ao. 





Proof of Theorem 3.1. We use induction on the number of vertices of the d- 
polytope P. If P has d + 1 vertices, then P is a simplex, and {P} is a trian- 
gulation. 

For the induction step, suppose we are given a d-polytope P with at least 
d+2 vertices. Fix a vertex v of P such that Q, the convex hull of the remaining 
vertices of P, is still of dimension d. By the induction hypothesis, we can 
triangulate Q. 

We call a facet F of Q visible from v if for any x € F, the half-open line 
segment (x, v] is disjoint from Q. By the Lemma, the triangulation T(Q) of 
Q induces the triangulation 


T(F) ={ANF: Ae T(Q), dim(ANF) =d-1} 


of a facet F of Q. 

Let T consist of the convex hulls of v with each (d — 1)-simplex in the tri- 
angulations of the visible facets. We claim that TUT(Q) forms a triangulation 
of P. To prove this, we have to show: 
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G) P=U(TUT(Q)) . 
(ii) For any A), Ag E€ TUT(Q), Ai N Ag is a face common to both A, 
and Ao. 


(i) P D U(TUT(Q)) follows by the definitions of T and T(Q). To prove 
P CU(TUT(Q)), assume x € P is given. If x € Q then x € UT(Q). If 
x € P\ Q, consider the line through v and x. This line meets Q (because P is 
convex), so let y € Q be the first point in Q that we meet when traveling along 
the line towards Q. This point y is on a facet of Q that is, by construction, 
visible from v, and hence x € A for some A E T. 


ii) Given A), A2 € TUT(Q), there are three cases: 


(ii) 

(a) Ai, A2 E€ T(Q); 

(b) Ay, Ag E€ T: 

(c) Ay E T, As E T(Q). 


In each case we have to show that A; N Ag is a face common to A, and Ag. 
(a) Since T(Q) is a triangulation, A, N Ag is a face of both A; and Ag. 


(b) Given A1, A2 € T, there exist S1, S2 E€ T(F) such that A; = conv {v, S1} 
and A2 = conv {v, S2}. Since T(F) is a triangulation, $1.52 is a face common 
to Sı and S2. By convexity, Ay N A, = conv {v, S1 N S2}. Exercise 2.6 shows 
that Sı N S2 is a simplex, and that this simplex is the convex hull of some 
of the common vertices of Sı and S2. But then A; N Ag = conv {v, S1 N S2} 
is the convex hull of some of the common vertices of A, and Ag and hence, 
again by Exercise 2.6, a face of both A; and Ag. 


(c) Since A; E T, there exists S € T(F) such that A; = conv{v,S}. By 
construction, A; N Q = S, and S is a face of some A € T(Q). Since T(Q) is 
a triangulation, AM Ag is a face common to A and Ag. But then 


A1 N A2 = SN A2 = (SNA) Nag = SN (AN A3) 


is an intersection of two faces of A and hence by Exercise 2.6 again a face of 
A and a simplex. The vertices of Ay N Ag = SN (AN A2) form a subset of 
the vertices common to S and Ag. Since S is a face of A1, A, N Ag is a face 
of both A; and Ag, by Exercise 2.6. 

















Hints for & Exercises 


Well here’s another clue for you all. 


John Lennon & Paul McCartney (“Glass Onion,” The White Album) 


Chapter 1 


1.1 Set up the partial fraction expansion as 


z A B 


I-z-2 j=; pae; 








and clear denominators to compute A and B; one can do so, for example, by 
specializing z. 





1.2 Multiply out (1 — z) (1 +z +2? +--+ 2”). For the infinite sum, note 
that limk—oo 2" = 0 if |z| < 1. 


1.3 Start with the observation that there are |x| + 1 lattice points in the 
interval [0, x]. 


1.4 (i) & (j) Write n = gm +r for some integers q,r such that 0 < r < m. 
Distinguish the cases r = 0 and r > 0. 


1.9 Use the fact that if m and n are relatively prime, given any a € Z there 
exists b € Z (which is unique modulo n) such that mb = a (mod n). For the 
second equality of sets, think about the case a = 0. 


1.12 First translate the line segment to the origin and explain why this trans- 
lation leaves the integer-point enumeration invariant. For the case (a,b) = 
(0,0), first study the problem under the restriction that gcd (c, d) = 1. 


1.17 Given a triangle 7 with vertices on the integer lattice, consider the 
parallelogram P formed by two fixed edges of T. Use integral translates of P 
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to tile the plane R?. Conclude from this tiling that P contains only its vertices 
as lattice points if and only if the area of P is 1. 


1.20 Given an integer b, the Euclidean algorithm asserts the existence 
of m1, M2,..., Mq E Z such that b can be represented as b = miaı + 
M202 +---+mMaag. Convince yourself that we can demand that in this rep- 
resentation 0 < m2,m3,...,Mq < a@,. Conclude that all integers beyond 
(ay — 1) (a2 + a3 +--+ + aq) are representable in terms of a1, a2,...,aa. (This 
argument can be refined to yield another proof of Theorem 1.2.) 


1.21 Use the setup 
b-1 


_ Ay Ag _ An 2 i Dj 
{Qe tte tet (ap +h oe ye . 


2 n = gE 
á 5 á 12T & 











To compute Ck, multiply both sides by (z — EE) and calculate the limit as 
z — €*. The coefficients D; can be computed in a similar fashion. 


1.22 Use Exercise 1.9 (with m = b~') on the left-hand side of the equation. 


1.24 Suppose a > b. The integer a+b certainly has a representation in terms 
of a and b, namely, 1 -a + 1-b. Think about how the coefficient of b would 
change if we changed the coefficient of a. 


1.29 Use the partial fraction setup (1.11), multiply both sides by (z — €*.), 
and take the limit as z — a 


1.31 Convince yourself of the generating-function setup 


: hn za a2 zad 
X pa(n) z =( 5) (==) (=). 
n>1 


Now use the machinery of Section 1.5. 








Chapter 2 


2.1 Use Exercise 1.3 for the closed interval. For open intervals, you can use 
Exercise 1.4(j) or the [...] notation of Exercise 1.4(e). To show the quasi- 
polynomial character, rewrite the greatest-integer function in terms of the 
fractional-part function. 


2.2 Write R as a direct product of two intervals and use Exercise 1.3. 


2.6 Start by showing that the convex hull of a d-element subset W of V is 
a face of A. This allows you to prove the first statement by induction (using 
Exercise 2.5). For the converse statement, given a supporting hyperplane H 
that defines the face F of A, let W C V consist of those vertices of A that 
are in H. Now prove that any point 
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x= Avi + A2Ve ae eae Àd+1Vd+1 


in F has to satisfy Ax = 0 for all v ¢ W. 


2.7 First show that the linear inequalities and equations describing a rational 
polytope can be chosen with rational coefficients, and then clear denominators. 


; 1 = k k k 
2.9 Write Uza = (es Zz :) ss Z a Suy (are Z a and 
come up with a combinatorial enumeration scheme to compute the coefficients 


of this power series. 


2.10 Write Cy = GH F RU tbh d+) and switch t to —t. 





2.14 Think about the poles of the function -y and use a theorem from 
complex analysis. 


ze 
l—e-?* 





2.15 Compute the generating function of Bg(1 — x) and rewrite it as 


2.16 Show that =4, + iz is an even function of Z. 





2.23 Follow the steps of the proof of Theorem 2.4. 


2.24 Extend T to a rectangle whose diagonal is the hypotenuse of 7, and 
consider the lattice points on this diagonal separately. 


2.25 For the area use elementary calculus. For the number of boundary 
points on tP, extend Exercise 1.12 to a set of line segments whose union 
forms a simple closed curve. 





2.31 Rewrite the inequality as (|| 1) e+ ([2] 1) f < tr and compare 
this with the definition of T. 


2.32 To compute C3, multiply both sides of (2.20) by (z — 3)? and compute 
the limit as z — 1. The coefficients A; and B; can be computed in a similar 
fashion. To compute C2, first move aos in (2.20) to the left-hand side, 
then multiply by (z — 1)? and take the limit as z — 1. A similar, even more 
elaborate, computation gives C1. (Alternatively, compute the Laurent series 
of the function in (2.20) at z = 1 with a computer algebra system such as 
Maple or Mathematica.) 


2.34 Follow the proof of Theorem 2.10. Use Exercise 2.33 to compute the ad- 
ditional coefficients in the partial fraction expansion of the generating function 
corresponding to this lattice-point count. 


2.36 Start with computing the constant term of 


1 
(1 — 2122) (1 — 2222) (1 — 21) (1 — 22) 23t 22t 





with respect to zg by treating z1 as a constant and setting up a partial fraction 
expansion of this function with respect to z2. 
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Chapter 3 


3.2 Write the simplicial cones as cones over simplices and use Exercise 2.6. 


3.4 Write down a typical term of the product 


Og (21, 225-+-52m) OT miir Zm42;+-+)2mtn)- 


3.5 Multiply out z™ox(z). 


3.6 Write a typical term in og (2, 4,...,4) =05 (Areas aiaa ) : 


3.8 Given the polynomial f, split up the generating function on the left-hand 
side according to the terms of f and use (2.2). Conversely, if the polynomial 
g is given, use (2.6). 


3.13 Show that H N Zf is a Z-module. Therefore it has a basis; extend this 
basis to a basis of Z. 


3.14 Start by proving the result for a single hyperplane, for example, by 
referring to Exercise 3.13. 


3.19 Given f, split up the generating function on the left-hand side according 
to the constituents of f; then use Exercise 3.8. Conversely, given g and h, 
multiply both by a polynomial to get the denominator on the right-hand side 
into the form (1 — z?)¢+1; then use (2.6). 


3.20 Start with the setup on page 75, and closely orient yourself along the 
proof of Theorem 3.8. 


3.29 Use Lemma 3.19. 


Chapter 4 


4.1 Use Exercise 2.1. 
4.2 Use the explicit description of II given by (4.3). 


4.3 Consider each simplicial cone K; separately, and look at the arrangement 
of its bounding hyperplanes. For each hyperplane, use Exercise 3.13. 


4.5 For (a), convince yourself that Q(—t) is also a quasipolynomial. For (b), 
use (1.3). For (c), differentiate (1.3). For (d), think about one constituent of 
the quasipolynomial at a time. 


4.6 In the generating function for Lp(t—k), make a change in the summation 
variable; then use Theorem 4.4. 


4.11 Use the fact that A has only integral entries. For the second part, write 
down the explicit hyperplane descriptions of (t + 1)P° and tP. 
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4.12 Assume that there exist t € Z and a facet hyperplane H of P such that 
there is a lattice point between tH and (t + 1)H. Translate this lattice point 
to a lattice point that violates (4.12). 


Chapter 5 


5.4 Consider an interval |F, P] in the face lattice of P: [F, P] contains all 
faces G such that F C G C P. Prove that if P is simple, any such interval is 
isomorphic to a Boolean lattice. 


5.5 Use Exercise 2.6 to show that the face lattice of a simplex is isomorphic 
to a Boolean lattice. 


Chapter 6 


6.1 Think permutation matrices. 
6.3 Show that the rank of (6.5) is 2n — 1. 


6.5 Start by showing that all permutation matrices are indeed vertices. Then 
use Exercise 6.4 to show that there are no other vertices. 


6.6 Establish a bijection between semimagic squares with line sum t—n and 
semimagic squares with positive entries and line sum t. 


6.7 Think about the smallest possible line sum if the entries of the square 
are positive integers. 


6.8 Follow the computation on page 112 that led to the formula for Hə. 
6.9 Multiply both sides of (6.7) by (w — i) and take the limit as w > a 


6.10 Orient yourself along the computation in (6.10). 


6.16 Compute the matrix equivalent to (6.5) for the polytope describing all 
magic squares of a given size. Show that this matrix has rank 2n + 1. 


6.18 Orient yourself along the computation on page 112. 


Chapter 7 

7.1 Show that both polynomials have the same roots and the same constant 
term. 

7.2 Use Exercise 7.1. 

7.5 Differentiate (1.3). 
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7.6 Use (1.3). 


7.7 Write an arbitrary function on Z with period b in terms of d(x), 1 < 
m <b. 


7.8 Use the definition (7.6) of the inner product and the properties zZ = |z|? 
and (zw) = Z-W for complex numbers z and w. 





7.14 Use the definition (7.4) and simplify the fractional-part function in the 
sum on the right-hand side. 


7.22 Use the definition of F. 


Chapter 8 


8.5 Use Exercise 1.9. 


8.7 Use the methods outlined in the hints for Exercises 1.21 and 2.32 to 
compute the partial fraction coefficients for z = 1 in (8.3). 


8.9 Multiply out all the terms on the left-hand side and make use of Exercises 
1.9 and 7.14. 


8.11 Use the methods outlined in the hints for Exercises 1.21 and 2.32 to 
compute a partial fraction expansion of (8.7). 


Chapter 9 


9.1 Show that (span Fy" N Kz is a cone. Then prove that if H is a defining 
hyperplane for F, then H N (span F i is a hyperplane in the vector space 
(span F ise Finally, show that this hyperplane HN (span F jt defines the apex 


of (span F)* N KF, and that this apex is a point. 


9.2 Consider the hyperplanes Hı, H2,...,Ha41 that bound A. For each hy- 
perplane Hg, denote by Ae the closed half-space bounded by Hy that con- 
tains A, and by H, the open half-space bounded by Hz that does not con- 
tain A. Show that every tangent cone of A is the intersection of some of the 
He ’s, and conversely, that every intersection of some of the H aa ’s, except 
for A = ugi H}, is a tangent cone of A. Since H} UH, = R’ as a dis- 
joint union, for each k, the point x is either in H} or H, . Prove that the 
intersection of those Hy that contain x is the sought-after tangent cone. 


9.4 As in Exercise 5.5, show that the face lattice of a simplex is a Boolean 
lattice. Note that any sublattice of a Boolean lattice is again Boolean. 


9.6 One approach to this problem is first to dilate P and the corresponding 
hyperplanes in H by a small factor. To avoid subtleties, first translate P by 
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an integer vector, if necessary, to ensure that none of the hyperplanes in H 
contains the origin. Use Exercise 3.13. 


9.7 Adjust the steps in Section 9.3 to open polytopes. Start by proving a 
Brianchon—Gram identity for open simplices, by analogy with Theorem 9.5. 
This implies a Brion-type identity for open simplices, as in Corollary 9.6. 
Finally, adjust the proof of Theorem 9.7 to open polytopes. 


Chapter 10 


10.1 Use (10.3), Exercise 2.18, and (2.11). 


10.3 Use the definition of unimodularity to show that the only integer point 
in the fundamental parallelepiped of K is v. 


10.4 Orient yourself along the proof of Theorem 10.4; instead of a sum over 
vertex cones, just consider one simple cone K. 


Chapter 11 


11.3 Multiply out z™ax(z). 


11.4 Orient yourself along the proof of Theorem 4.2. Note that for solid 
angles, we do not require the condition that the boundary of K contains no 
lattice point. 


11.5 As a warm-up exercise, show that 


5 5 ok (Fy (z) = 5 SE oFo(Z). 


aon v a vertex of F v a vertex of A E 
11.6 Start with the setup of our second proof of Ehrhart’s theorem in Sec- 
tion 9.4; that is, it suffices to prove that if p is the denominator of P, then 
Ap(—r — pt) = (-1)%™? Ap(r + pt) for any integers r and t with 0 < r < p 
and t > 0. (Think of r as fixed and t as variable.) Now orient yourself along 
the proof on page 184. 


Chapter 12 
12.1 Bound the integral from above, using the length of C, and an upper 
bound for the absolute value of the integrand. 


12.2 The nontrivial roots of unity are simple poles of f, for which the residue 
computation boils down to a simple limit. 
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12.4 
12.5 
12.6 
12.7 
12.8 
12.9 
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1 
z—(m+ni 


Differentiate (12.1) term by term. 


1 


- into one fraction. 
m+ni 


Start by combining the terms 





j and 


Compute g’ explicitly. 
Use a famous theorem from complex analysis. 
Compute ø (—z) and use the fact that (—(m + in))? = (m + in)?. 


Repeat the proof of Lemma 12.3, but now starting with the proof of 


p' (z +i) = g (2). 


12.10 Use a famous theorem from complex analysis. 


12.11 Use the definition of the Weierstraß go function. 
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